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Abstract. This paper calculates the function c(a) whose value at a is the infimum 
of the size of a ball that contains a symplectic image of the ellipsoid E(l,a). (Here 
a > 1 is the ratio of the area of the large axis to that of the smaller axis.) The 
structure of the graph of c(a) is surprisingly rich. The volume constraint implies 
that c(a) is always greater than or equal to the square root of a, and it is not hard 
to see that this is equality for large a. However, for a less than the fourth power 
t 4 of the golden ratio, c(a) is piecewise linear, with graph that alternately lies on 
a line through the origin and is horizontal. We prove this by showing that there 
are exceptional curves in blow ups of the complex projective plane whose homology 
classes are given by the continued fraction expansions of ratios of Fibonacci numbers. 
On the interval [t 4 , 7] we find c(a) = s ^-. For a > 7, the function c(a) coincides 
with the square root except on a finite number of intervals where it is again piecewise 
linear. The embedding constraints coming from embedded contact homology give 
rise to another capacity function cech which may be computed by counting lattice 
points in appropriate right angled triangles. According to Hutchings and Taubes, the 
functorial properties of embedded contact homology imply that cech(o) < c(o) for 
all a. We show here that cech{o) > c(a) for all a. 
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1. Introduction 



1.1. Statement of results. As has been known since the time of Gromov's Non- 
squeezing Theorem, questions about symplectic embeddings lie at the heart of sym- 
plectic geometry. To date, most results have concerned the embeddings of balls or of 
products of balls since these are most amenable to analysis. (See Cieliebak, Hofer, 
Schlenk and Latschev [5] for a comprehensive survey of embedding problems.) How- 
ever, ellipsoids are another very natural class of examples. As pointed out by Hofer, the 
simplicity of the characteristic flow on their boundary makes them a natural test case 
for understanding the role of variational properties in symplectic geometry. One would 
like to understand the extent to which obstructions coming from periodic orbits cap- 
ture all symplectic invariants. Judging from the evidence of the current work, it seems 
one cannot take a naive approach. As pointed out in McDuff [15], the Ekeland-Hofer 
capacities of [6] (which are purely variational) do not give all obstructions. Instead one 
must use invariants coming from the Hutchings-Taubes [10] embedded contact homolo- 
gy, which has an unavoidably geometric flavor. Indeed Taubes [21] has recently shown 
it equals a version of Seiberg-Witten Floer homology and so is a gauge theory. 

In view of the work of Guth [7] on higher dimensional symplectic embedding ques- 
tions, there has been renewed interest in this kind of question. However, we restrict 
consideration to four dimensions since the methods and results in this case are very 
different from those in higher dimensions; cf. Remark 1.1.4. For relevant background 
and a survey of the results of the current paper see [16]. 

Given a real number a > 1 denote by E(l,a) the closed ellipsoid 
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This paper studies the function c: [l,oo) — > M. defined by 

(1.1) c(a) := inf |/i : £7(1, a) A 

where £*(/x) := |^ x i — A*} i s the ball of radius y/fi, and a4b means that A embeds 
symplectically in B. This is one of a range of symplectic capacity functions defined by 
Cieliebak, Hofer, Latschev and Schlenk in [5], and the first to be calculated. 

Since £7(1, a) has volume an 2 /2, we must have c(a) > yfa. Here is another elementary 
result. 

Lemma 1.1.1. The function c is nondecreasing and continuous. Further, it has the 
following scaling property: 

c(Xa) c(a) 

(1.2) -V^ < — when A > 1. 

Xa a 

Proof. The first statement is clear. The second holds because £7(1, Aa) C V~AE(l,a) 
when A > 1 and also £7(1, a ) A-B(fi) if and only if y/\E(l,a) A\/A5(/i) = B(Xfi). □ 

The function c(a) was calculated 1 in [15] for integral a as follows: 

(1.3) c(a) = yfa if a G N is 1, 4 or > 9, 

c(2)=c(3) = c(4) = 2, c (5)=c(6) = §, c(7) = f, c(8) = f . 

Its monotonicity and scaling property are then enough to determine all its values for 
a < 6: it is constant on the intervals [2, 4] and [5, 6] and otherwise linear, with graph 
along appropriate lines through the origin. (See Figure 1.1 and Corollaries 1.2.4 and 
1.2.8 below.) 

It turns out that the two steps of c(a) that we described above extend to an infinite 
stairs for a £ [1, r 4 ] where r 4 = Z±|^I is the fourth power of the golden ratio r := 1+ 2 V ^ . 
We call this Fibonacci stairs. Denote by g n := f2n-u n > 1, the terms in the odd places 
of the Fibonacci sequence /„. (For short, we call these the "odd Fibonacci numbers".) 
Thus the sequence g n starts with 1, 2, 5, 13, 34, .... Set go = 1 and for each n > 
define 

a n = ( g -fA 2 and b n = 9 -f±. 

Then 

a = 1 < fe = f = 2 < °i = (f ) 2 = 4 < ^i = f = 5 < 
a 2 = (|) 2 = 6i < 6 2 = f = 6i < a 3 = (f) 2 = 6i < 6 3 = f = 6| < .... 
More generally, 

• • • < a n < b n < a n +i < b n +i < ■ ■ ■ , and lima n = lim6 n = r 4 « 6.854. 



The first nontrivial result here, that c(4) = 2, was proved earlier by Opshtein in [20]. 
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Figure 1.1. The Fibonacci stairs: The graph of c(a) on [l,r 4 ]. 

Theorem 1.1.2. (i) For each n > 0, c(a) = ^= /or a 6 [a n , 6 n ], and c zs constant 
with value ^/a n +i on the interval [b n ,a n+ i\. 
(ii) c(a) = f on [r 4 ,7]. 

(hi) There are a finite number of closed disjoint intervals Ij C [7, 8^] such that 
c(a) = ^/a for all a > 7 ', a ^ ij. Moreover, c is piecewise linear in each Ij, with 
one non-smooth point in the interior of Ij . 

(iv) c(a) = y/a for a > 8^. 

The argument proving part (i) hinges on the existence of an unexpected relation 
between the function c(a) and the Fibonacci numbers. We shall see that this relation 
persists for a just larger than r 4 , and so we also deal with the interval [r 4 , 7] by largely 
arithmetic means. However, the analysis of c(a) for a > r 4 gets easier the larger a is. 
As we show in Corollary 1.2.4, it is almost trivial to see that c(a) = \fa when a > 9, 
and it is not much harder to see that c(a) = \Ja when a > 8^. The method used 
also shows that there are finitely many obstructions when a > 7. The full analysis of 
c(a) on [7, 8^] takes more effort. The intervals Ij contain rational numbers with small 
denominators; for example the three longest contain 7,7^, and 8, cf. Figure 1.2. 
We refer to Theorem 5.2.3 for a full description of c(a) on the interval [7,8^]. One 
point to note here is that although all the flatter portions of the graph of c are horizontal 
when a < r 4 , this is not true when a G [7, 8]; for example the two parts of the graph 
of c centered at a = 7i both have positive slope. 



THE EMBEDDING CAPACITY OF 4-DIMENSIONAL SYMPLECTIC ELLIPSOIDS 



5 



c(a) 




1 1 1 — ; — i — i 1 h — a 

r 4 7 7 i 7 i 7 i 7 i 8 

Figure 1.2. The graph of c(a) on [r 4 ,8^]. 

Connection with counting lattice points 

As we explain in §1.3 below, the obstructions to embeddings E(l, a) <^-> B(p) that we 
consider come from exceptional spheres in blow ups of CP 2 . Hofer 2 suggested that one 
should also be able to obtain a complete set of obstructions from the embedded contact 
homology theory recently developed by Hutchings and Taubes [10]. The embedded 
contact homology ECH* {E{a, &)) of a 4-dimensional ellipsoid has one generator in 
each even degree with action of the form ma + nb; m,n > 0. Since the action is a 
nondecreasing function of degree, the actions of the generators arranged in the order 
of increasing degree form the sequence N(a, b) obtained by arranging all numbers of 
the form ma + nb; m, n > 0, in nondecreasing order (with multiplicities). We will say 
that N(a,b) =4 N(a',b') if each term in N(a,b) is no greater than the corresponding 
term in N(a',b'). It is likely (though not yet fully proven) that ECH* is functorial, 

so that there is an embedding E(a,b) ^> E(a! ,b') only if there is an injective map 
ECH*(E(a,b)) — > ECH*(E{a! ,b')) that increases action. If this were true, then the 
sequence N(a,b) would be a monotone invariant of E(a,b). Thus, Hofer's suggestion 
is that 

(C) E(a,b)AE(a',b') ^ N(a,b) 4 N(a' ',&'). 

One might be able to prove this directly by showing that the embedded curves that 
provide the morphism ECH*(E(a,b)) — > ECH*(E(a' ,&')) correspond precisely to the 
exceptional spheres that give our obstructions. We will take a more indirect approach. 
For a > 1 define 

CECH(a) := inf {// > | 7V(1, a) 4 N(ja, fi)} . 
Then Conjecture (C) for the case that the target ellipsoid is a ball becomes 
(1.4) cech{o) = c(a) for all a > 1. 



Private communication. 
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The expected functorial properties of embedded contact homology should imply that 
cech{o) < c(a) for all a; see Hutchings-Taubes [11], as well as Remark 2.4.5 (ii). In 
this paper we prove the converse. 

Theorem 1.1.3. cech(cl) > c(a) for all a > 1. 

Thus, in the end, we should have cech{o) = c(a). 

Remark 1.1.4. (i) The methods used to analyze the embedding of a 4-dimensional 
ellipsoid into a ball work equally well when one considers embeddings from one ellipsoid 
to another. In other words, Theorem 1.2.2 below has an analog that is applicable to 
this setting; see [15, Theorem 1.5]. One can also use much the same method to analyze 
embeddings of an ellipsoid into S 2 x S 2 ; see [18]. 

(ii) One might wonder if these results can be extended to higher dimensions. For 
example, in dimension 6 is there is a symplectic embedding P(a, fr, c) E{a! ,b' , c') 
if and only if N(a,b,c) ^ N(a' ,b' ,c')7 Guth's construction in [7] of an embedding 
P(l, R, R) A P(2, 10, 2R 2 ) for all R > 1 shows that the answer is no. It is not at 
present clear what the correct condition should be in this case. (See Hind-Kerman [9] 
for a more precise version of Guth's result.) Note that embedded contact homology 
is a specifically 4-dimensional theory, as are the results stated in Theorem 1.2.2 and 
Proposition 1.2.12 below on which our calculation of c is based. 

(iii) There are two early papers by Biran with constructions that are somewhat 
similar to ours. In [3] he uses an iterated ball packing construction to obtain information 
on the Kahler cone of blow ups of CP 2 . Continued fractions are relevant here, but 
Biran does not use them in the way we do. The survey article [4] mentions how an 
understanding of embeddings of ellipsoids might help calculate this Kahler cone, and 
hence suggests a potential application of our work. However, for this one would need 
to understand which embeddings of ellipsoids give rise to Kahler forms, a question that 
we do not consider. O 

1.2. Method of proof. The first author showed in [15] that if a > 1 is rational there 
is a finite sequence w(a) := (w±, . . . ,wm) of rational numbers such that the ellipsoid 
£7(1, a) embeds symplectically in the ball £>(//) exactly if the corresponding collection 
UiB(wi) of M disjoint balls embeds symplectically in P(/t/). This ball embedding 
problem was reduced in McDuff-Polterovich [17] to the question of understanding the 
symplectic cone of the M-fold blow up of CP 2 . After further work by Biran [2] and 
McDuff [14], the structure of this cone was finally elucidated in Li-Liu [13] and Li- 
Li [12]. 

The key to understanding this cone is the following set Em- 

Definition 1.2.1. Denote by Xm the M-fold blow up of CP 2 with any symplectic struc- 
ture ojm obtained by blow-up from the standard structure on CP 2 . Let L,Ei, . . . , Em G 
Pl2{Xm) be the homology classes of the line and the M exceptional divisors. We de- 
fine £m to be the set consisting of (0; —1, 0, . . . , 0) and of all tuples (d; m) of nonneg- 
ative integers (d; mi, . . . , tum) with m\ > ■ ■ ■ > tum and such that the class P^m) := 
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dL — '^2 i miEi is represented in (Xm,^m) by a symplectically embedded sphere of self- 
intersection — 1. // there is no danger of confusion, we will write £ instead of Em- 
Clearly, Em C Em 1 whenever M < M' . 

Since these classes Er d . m \, (d;m) G Em, have nontrivial Gromov invariant, they 
have symplectically embedded representatives for all choices of the blow-up form u>m- 
Therefore the above definition does not depend on this choice. 

Denote by —K := 3L — Ei the standard anti-canonical divisor in Xm, and consider 
the corresponding symplectic cone Ck, consisting of all classes on Xm that may be 
represented by a symplectic form with first Chern class Poincare dual to —K. Then 
Li-Li show in [12] that 

C K = {a£ H 2 (X M ) | a 2 > 0, a(E) > for all E G E M ) ■ 

Proposition 1.2.12 below gives necessary and sufficient conditions for an element (d; m) 
to belong to Em- Before discussing this, we explain the relevance of Em to our problem. 
The following result is proved in [15]. We will denote by i, e« G H 2 {Xm) the Poincare 
duals to L, Ei and by m • w = YldLi m i w i the Euclidean scalar product in R M . 

Theorem 1.2.2. For each rational a > 1 there is a finite weight expansion w(a) = 
(wi, . . . , wm) such that E(l,a) embeds symplectically in the interior of B 4 (fi) if and 
only if fj,£ — ^ w^ei G Ck- Moreover Wi < 1 for all i and ^ wf = a. 

Corollary 1.2.3. If the rational number a > 1 has weight expansion w(a) = w = (wi), 
then 

c(a) = sup^v 7 ^) fi(d;m)(a) \ (d;m) G E^j , 
where /x(d;m)(a) := m ™^ . 

Proof. The above description of Ck shows that E(l,a) embeds into the interior of 
B 4 ([i) if and only if the tuple (l~i,w) satisfies the conditions 

(i) fi 2 > w ■ w =: £>f, 

(ii) d/j, > m ■ w =: miWi for all (d; m) G Em- 

The corollary now follows because w ■ w = a. □ 

Biran showed in [2] that /j,(d;m)(k) < \fk for all (d;m) G £ for all integers k > 9. 
His argument extends to all a > 9 and shows: 

Corollary 1.2.4. c(a) = ^fa when a > 9. 

Proof. Since c is continuous by Lemma 1.1.1, it suffices to check this for rational a. Fix 
(d;m) G £. The corresponding symplectically embedded (— l)-sphere E has c\{E) = 1, 
and so 3d — 1 = J2i m i- Therefore, J2i m i w i — J2i m i = 3d — 1. For a > 9 we thus find 

fi(d;m)(a) := ^ < 3 < Ja~. 

Now use Corollary 1.2.3. □ 
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In view of Corollary 1.2.3, our task is two-fold; first to understand the weight expan- 
sions and then to understand the restrictions placed on embeddings by the elements 
of Em- The description that we now give for to (a) is convenient for calculations but is 
somewhat different from that in [15]. The equivalence of the two definitions is estab- 
lished in Corollary A. 10. 

Definition 1.2.5. Let a = p/q £ Q written in lowest terms. The weight expansion 

w := (wi) := (u>i, . . . ,wm) of a > 1 is defined recursively as follows: 

• w\ = 1, and w n > w n+ \ > for all n; 

• if Wi > Wi + \ = • • • = w n (where we set wo := a), then 

if m+i H h w n +i = (n-i + l)w i+1 < Wi 

(n — i)wi + \ otherwise; 

• the sequence stops at w n if the above formula gives w n+ \ = 0. 
The number M of entries in w(a) is called the length 1(a) of a. 

For example, a = 25/9 has weight expansion w(a) = (1, 1, |, §,§,§, g, g)> which we 
will abbreviate as (l x2 ,f,§ x3 ,± x2 ). 

We may also think of this expansion (wi) as consisting of iV + 1 blocks of length £ s 
of the (decreasing) numbers x s where xq = 1; viz: 

(1.5) w(a) := (l, . .. , 1, xi, ...,x 1; x N , . .. , x N ) 

e h £ N 

— ^1 , Xj , . . . , Xpj j . 

Then x\ = a — £q < 1, X2 = 1 — £ix\ < x±, and so on. In this form the sequence can 
be generated as follows. If a = 2, first draw a rectangle of length p and height q, then 
mark off as many (say £q) squares of side length q as possible, then in the remaining 
rectangle of size q x (p — £^q) mark off as many (say £±) squares of side length (p — £$q) 
as possible, continuing in this way until the rectangle is completely filled. Then qxj 
is the side length of the (j + l)st set of squares, while the £j are the multiplicities: 
see Figure 1.3. As is well known, the multiplicities £j,0 < j < N, give the continued 
fraction expansion [£q\ £±, . . . , £jy] of p/q. For example, 25/9 = [2; 1, 3, 2] and 

25 1 

= 2 + 



9 1 + ^t 



Notice also that 25/9 = 2 • l 2 + (7/9) 2 + 3(2/9) 2 + 2(l/9) 2 . 
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FIGURE 1.3. The expansion for a = 25/9. 



Lemma 1.2.6. Let w 


= {wi,... 


, wm) be the weight expansion 
WM = \, 


(1.6) 


w ■ w 


M 

i=i 


(1.7) 




M 

i=i 



Proof. Equation (1.6) holds because the total area of all the squares is pq. To under- 
stand the sum, suppose that there are N + 1 sets of squares in the expansion (1.5) so 
that xn+i = and write 

J]u;i = 1+ f l + - - - + l +xi) + ( xi + • • • +xi +x 2 ) +■■■ 
i «o-l ^"^^ 

+ ( ^Af + - ; / + ^JV +^Af+l) 

= 1 + (a — 1) + (1 — xi) H h (x7v_! - xat) 

= 1 + a — xat. 

It remains to note that xn = wm = V<7- This is obvious from the geometric construc- 
tion. For, gxjv = ?wm is the side length of the smallest square in the decomposition of 
the rectangle. If this length were divisible by s, then the side lengths of all the squares 
would be divisible by s. Hence both p and q would be divisible by s. But they are 
mutually prime by hypothesis. □ 

We next describe the sets £m- The first lemma is well known, and can be easily 
deduced from Proposition 1.2.12 below. 

Lemma 1.2.7. The set £m is finite for M < 8 with elements (d; mi, . . . , thm) equal 
to: 

(0;-l), (1;1,1), (2;l x5 ), (3;2,l x6 ), 
(4;2 x3 ,l x5 ), (5;2 X6 ,1,1), (6;3,2 x7 ). 
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Prom this one can immediately calculate c(a) for those a whose weight expansion 
has k < 8. 



Corollary 1.2.8. The function c takes the following values: 

2' 



c(2) = c(3) = c(4) = 2, c(5) = c(6) - 5 



c(f)=f, c(7) = |, c(8) = f. 

Moreover, its graph is linear on each subinterval [1,2], [2,4], [4,5], [5,6]. 

Proof. The values of c(a) for integers a G [1,8] were calculated in [15, Cor 1.2]. One 
can similarly calculate c(^) since the length of «?(^) is < 9. The second statement 
then follows from Lemma 1.1.1. □ 

From Lemma 1.2.7 we can also compute c near 7. 
Proposition 1.2.9. For a G [6^,7] we have c(a) = j(a + 1). Also c(a) = | for 

Proof. Since c is continuous, it suffices to prove these identities for a G Q. First assume 
that a < 7 and write a = 6 + x. Then 

u?(a) = (l x6 ,x,tt; 8 , . . . , w M ) , 

where < Wi < 1 — x for i > 8. The element (3; 2, l x6 ) G £7 gives the constraint 

c(a) > /Uo = 5(0 + !)■ 

Since 1 — x < x/9, at least the first 9 of the weights ws,wg, . . . are equal. Hence, by 
Corollary 1.2.4, we can fully pack all but the first 7 balls into one ball of size A where 
a = 6 + x 2 + A 2 . It remains to show that the 8 balls of sizes 

W = (l,...,l,x,X) 

fit into B((io), that is, W ■ m < jj(7 + x) for all (d; m) G £ . 

This is clear for classes in £ 7 . The classes in £$^£7 are (4;2 x3 ,l x5 ), (5;2 x6 ,l,l), 
(6; 3, 2 x7 ). The strongest constraint comes from (5; 2 x6 , 1, 1) and equals 

Ati = i(l2 + x + A). 

The desired inequality fii < hq is equivalent to 1 + 3A < 2x. Since A 2 = x(l — x) we 
need 13x 2 — 13x + 1 > 0, which is satisfied when x > ^. 

We know c(7) = | by Corollary 1.2.8. Therefore it suffices to show that c(7|) = |. 
As above, since the nine balls B(^) fully fill B(^), we just need to check that the finite 
number of elements in £% give no obstruction to embedding 8 balls, seven of size 1 and 
one of size |, into -B(f). □ 

Remark 1.2.10. Similarly, there is an obstruction at a = 8 given by the class (d; m) = 
(6; 3, 2 x7 ). For a < 8 with w(a) = (l x7 , a— 7, ... ) this gives the constraint //(a) = ^^p, 
while for a > 8 we get fj,(a) = Therefore c(a) > /u(a) > for 8+ 3\/7 < a < ^1 
However, unfortunately, one cannot argue as above to show that c = /ton some interval 
(8 — e, 8jjg] because the auxiliary packings would involve 9 balls and £g is infinite. We 
shall prove that c = pon j" 8+3\/7 ^ g^j - n Sections 5.2 and 5.3 by different methods. O 
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Now consider Em, M > 9. We say that a tuple of integers (d; m) = (d; m±, . . . , uim) 
is ordered if m; > m^+i when m; / 0, rrij+i / 0, and if the rrii with = are at 
the end. For instance, the elements of Em are ordered in view of Definition 1.2.1. To 
characterize Em when M > 9 we need the following definition. 

Definition 1.2.11. The Cremona transform of an ordered tuple (d;m) is 

(2d — mi — mi — 7773; d — mi — m%, d — m\ — m$, d — m\ — 7772, m±, 777.5, • • • ) • 

A standard Cremona move Cr takes an ordered tuple (d; m) to the tuple obtained 
by ordering the Cremona transform of (d;m). More generally, a Cremona move is 
the composite of a Cremona transform with any permutation of m. 

Standard Cremona moves preserve Em because they are achieved by Cremona trans- 
formations, which (modulo permutations of the Ei) are just reflections A 1— > A+(A-C) C 
in the (— 2)-sphere in the class C := L — E\ — E2 — £3; cf. [12]. 3 In particular these 
moves preserve the intersection product and the first Chern class ci(M) := 3L — J2i Ei- 

Proposition 1.2.12. (i) The following identities hold for all (d;m) 6 Em- 

(1.8) rrii = 3d — 1, m • m := Tn| = d 2 + 1. 

i i 

(ii) For all pairs (d;m),(d';m') of distinct elements of Em we have 

m ■ m! := ^ m^ m[ < dd' . 

i 

(iii) A tuple (d;m) satisfying the Diophantine conditions in Equation (1.8) belongs 
to Em exactly if it may be reduced to (0; —1, 0, ... , 0) by repeated standard Cre- 
mona moves. 

Proof. The two equations in (i) express the fact that any symplectically embedded (— 1)- 
sphere E has c\(E) = 1 and E ■ E = —1. Since the elements in Em are all represented 
by embedded J-holomorphic spheres for generic J, part (ii) holds by positivity of 
intersections. 

Part (iii) for a class (d; m) with d = is clear. Part (iii) for non-negative tuples 
(d;m) may be deduced from Li-Li's arguments in [12, Lemma 3.4]. In this paper the 
authors work in a more general context than ours, considering all symplectic forms 
on Xm, while we consider only those symplectic forms with the standard first Chern 
class (or anticanonical class) — K := 3L — ^ Ei. They also introduce many ideas, such 
as the symplectic genus. However, the concept relevant here is that of a reduced class. 
This is a class A := dL — m iEi with 

d > 0, 777i > m 2 > • • • > 0, d > 777i + m 2 + m 3- 

We can clearly assume that M > 3. In this case they show that for every class A 
with A 2 = — 1 and c\(A) > there is a combination of Cremona moves and reflections 



If to is a symplectic form on Xm for which the class C is represented by a Lagrangian sphere Sl, 
then the Cremona transformation can be realized by the Dehn twist in Sl; cf. Seidel [19]. 
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Ei i y —Ei that transform A either into E\ (which corresponds to (0; — 1, 0, . . . , 0)) 
or into a reduced class A'. Each step consists of a Cremona transform, followed by 
an adjustment of signs to make the coefficients of the —Ei non- negative and then a 
permutation to reorder the m^. Since the Cremona transform takes (d; m) to 

{2d — mi — 771,2 — ^3! d — 777,2 — 1^3, d — mi — "7-3> d — mi — 777,2, 1^4, • • • ) 

this decreases d unless A is reduced. Li-Li show by a simple algebraic computation 
that, because we start with a class with A- A > — 1 and the reduction process preserves 
the intersection form, the coefficient d cannot become negative. Thus one stops the 
process when the coefficient d is at its minimum. If d = then the final class Aq 
is (0; — 1, . . . , 0). On the other hand, if Aq is reduced, another essentially algebraic 
argument (part 3 of their Lemma 3.4) shows that Aq • E > for all solutions E to the 
equations (1.8). Hence Aq £ £m since Aq ■ Aq = — 1. 

It is easy to adapt the results of this lemma to our situation. We are interested here 
only in symplectic forms with the standard canonical class K, and therefore cannot 
change the signs of the E^. However, if there is a sequence a of standard Cremona 
moves that takes a tuple (d; m) that satisfies (1.8) to a tuple (d'; m') with some 777^ < 0, 
then either d' = and (d';m') = (0; —1,0, . . . , 0), or d' 7^ 0. But in the latter case 
(d; m) cannot be in £m since we would have 



where E is the image of Ei under the reverse sequence a of Cremona moves. There- 



Definition 1.2.13. A class E = (d;m) G £ is called obstructive if u(d;m)(z) > 
^fz on some nonempty interval I. Further we say that E is obstructive at a if 
/j,(d; m)(a) > ^/a. 

Thus our task is to understand enough about the obstructive classes to figure out 
the supremum of the corresponding constraint functions fj,(d;m). 

Remark 1.2.14. (i) Later we will expend considerable effort to show that certain 
classes E = (d;m) that satisfy the identities (1.8) do in fact lie in £m- In some cases, 
the corresponding constraints fi(d;m)(a) contribute to c(a). However, in many other 
cases (for example the classes E(bk(i)} of Proposition 4.2.2 for i > 3, see Lemma 4.3.1) 
the constraint a(d; m)(a) does not contribute to c(o); rather E influences c(a) because 
E ■ E' > for all E' £ Sm^E. For this positivity of intersections to hold, it is not 
necessary that E £ Em- As explained in the proof of Proposition 1.2.12, it suffices that 
when we apply standard Cremona moves to E we do not arrive at a class with d > 
and some 77ij < 0, but instead end up at a reduced class. (By [12, Lemma 3.6], the 
class E then must have positive symplectic genus, and therefore cannot be represented 
by a smoothly embedded sphere.) However, it is just as difficult to check this condition 
as it is to check whether E G £m, and, in fact, it turns out that E G £m in all cases of 
interest to us here. 



E(d;m) ■ E — E( d /. m /) 



Ei = m'i < 0, 



fore (iii) must hold. 



□ 
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(ii) As we show in Proposition 5.2.1, there are only finitely many tuples (d; m) £ £m 
that are obstructive at some a > 7. In fact, there are precisely 13 such classes; the 
class (3; 2, l x6 ) centered at a = 7, another 8 classes that contribute to c(a) as described 
in Theorem 5.2.3, and 4 more classes listed in Lemma 5.2.5, that are "hidden" in the 
sense that they contribute nothing new to c(a). Although we work mostly by hand, we 
do use the computer programs of Appendix B to prove Corollary 5.2.10, which states 
that there are no other relevant classes. 

In contrast, there are infinitely many classes that are obstructive somewhere on the 
interval [1,7], and we do not try to compute them all. As shown in Example 2.3.1, 
the part of the graph of an obstruction fj,(d;m)(a) that lies above ^/a can be quite 
complicated and need not have the scaling or positivity properties of c that are described 
in Lemma 1.1.1. Further, even though Corollary 2.1.4 states that at each point a where 
c(a) > yfa there are only finitely many obstructive classes with /j,(d;m)(a) = c(a), we 
do not know if for some ao there are infinitely many classes with /x(d;m)(ao) > y/a . 
By Remark 5.2.2 this cannot happen when ao > r 4 . When ao = r 4 , Proposition 4.3.2 
shows that there are infinitely many classes that are obstructive on an interval whose 
closure contains ao- However, because c(r 4 ) = r 2 no class is obstructive at r 4 itself. 
We have no relevant results when ao < r 4 . 

(iii) We compute c(a) for a < 7 by looking not only at classes with fj,(d;m)(a) > yfa 
but also at some other classes that influence c(a) indirectly. The most interesting of 
these are the classes described in Proposition 4.3.2 that are made from the even terms 
of the Fibonacci sequence. They play a dual role. Though obstructive, they contribute 
nothing new to c(a) and so the corresponding graph is called the ghost stairs. Their 
importance is rather that they allow one to calculate c(a) at a series of points e& where 
they are not obstructive; cf. the proof of Corollary 4.2.4. O 

1.3. Outline of paper. Corollary 1.2.3 gives a formula for c(a) that we can interpret 
using the description of £ contained in Proposition 1.2.12. However, this formula is 
not at all explicit, and the methods needed to understand it depend on the size of a. 
One can compute c(a) by direct methods when a < r 4 , but for larger a our arguments 
require a deeper understanding of the constraint functions fj,(d;m)(a). Therefore we 
begin in Section 2 by developing some tools to distinguish the obstructive classes in £. 

First, we show in Proposition 2.1.1 that (d; m) is obstructive at a only if the vector m 
is almost parallel to w(a). If m is parallel to w(a) for some a, then we call (d;m) 
a perfect obstruction at a. Lemma 2.1.5 shows that these elements determine the 
function c(z) for z near a, while Corollary 3.1.3 shows that the only perfect obstructions 
occur at the numbers b n of the Fibonacci stairs. Second, we show in Lemma 2.1.3 that if 
[i(d; m)(a) > \fa on the interval /, then / has a unique central point ao distinguished 
by the fact that £(ao) = £(m) while £(a) > £{m) for all other a 6 I. (Here, £(m) 
denotes the number of positive entries in m.) Lemmas 2.1.7 and 2.1.8 describe other 
useful properties of obstructive classes. 

These are the basic results needed to determine c(a) when a > 7. (Since by Proposi- 
tion 5.2.1 there are only finitely many obstructive (d; m) for a > 7, we can analyze these 
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on a case by case basis, without using more general results.) However, we continue in 
Section 2 with a deeper analysis of the functions n(d;m), so that we can explain the 
relation of c(a) to the lattice point counting problem. This analysis is based on Propo- 
sition 2.2.6 which derives surprising identities satisfied by weight expansions. (These 
are quadratic identities involving the weight expansions of a and its "mirror" a.) This 
proposition also turns out to be helpful in understanding c(a) on [r 4 ,7], where there 
are infinitely many obstructive classes. 

Our next main result is Proposition 2.3.2 which shows that the central point of I 
is the break point of fi(d;m) in the sense that /j,(d;m) is linear on each component 
of 7\{ao}. Moreover, one can apply Proposition 2.2.6 to show that the coefficients of 
these linear functions are remarkably close to those of the linear functions that occur 
in the counting problem. In §2.4, we explain this connection and prove Theorem 1.1.3 
(which states that cech > c). 

In Section 3 we calculate c(a) for a € [1, r 4 ] by direct methods. Theorem 3.1.1 states 
that there are classes E(a n ) and E(b n ) in £ given by tuples (d; m) constructed from 
the weight expansions of ratios of odd Fibonacci numbers. As we see in Corollary 3.1.2, 
because these classes are perfect their very existence together with the scaling property 
of c is enough to calculate c(a) in this range. The difficulty here is to prove that these 
classes really do belong to £ . In particular, describing what happens to these classes 
under Cremona moves involves establishing many quadratic identities for Fibonacci 
numbers. Therefore in Section 3.2 we develop an inductive method to prove such 
identities; cf. Proposition 3.2.3. The proof of Theorem 3.1.1 is completed in §3.3. This 
section is essentially independent of Section 2. 

We next compute c(a) on [r 4 , 7]. The obstruction (3; 2, l x6 ) centered at 7 gives the 
lower bound c(a) > on this interval, and our task in Section 4 is to show that no 
obstruction exceeds this one. One difficulty is that the quantity y(a) := a + 1 — Sy/a 
tends to as a approaches r 4 , permitting the existence of infinitely many obstructive 
classes; cf. Proposition 4.3.2. Another is that the line ^p- does not pass through the 
origin. Therefore we can no longer use the scaling property of c, which in the case of 
the interval [1, r 4 ] allowed us to restrict attention to the points a n , b n . Nevertheless, by 
using the results of Section 2 we show in Proposition 4.1.6 that there is only a double 
sequence of relevant points. 

One could then attempt a direct calculation of c(a) at these points, combining more 
elaborate versions of the estimation techniques used in Section 5 with arithmetic results 
based on Corollary 2.2.7. This is possible. However it is very complicated and it turns 
out that there is a much easier proof. The sequence E(b n ) of perfect classes that de- 
termine the Fibonacci stairs really consists of two subsequences Ek(0) and £^(1) that 
are the first two members of an infinite family E^(i), i > 0, of sequences of "nearly 
perfect" classes in £ . The classes Ek(2), k > 1, form the ghost stairs discussed in 
Remark 1.2.14(iii), while the classes Ef,(i), i > 2, are not obstructive by Lemma 4.3.1. 
Nevertheless, as we show in Lemma 4.1.10, the fact that they are nearly perfect puts 
constraints on the possible obstructive classes. The desired conclusion follows by com- 
bining this result with Proposition 4.1.6. 
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Section 5 carries out a detailed analysis of the obstructions in the interval [7, 9] . The 
argument is based on the equality in Proposition 2.1.1 (iv). This estimates the "error" 
(the difference between m and a suitable multiple of w(a)) at a rational point a = p/q 
in terms of the quantity y(a) — l/q, where again y{a) = a+l—3y/a. Since y(r 4 ) = 0, this 
estimate gets better the further a is from r 4 and the larger q is. One easy consequence 
is Proposition 5.2.1, stating that there are only finitely many obstructive classes (d; m) 
for a > 7. The proof (in §5.2) that c(a) = y/a for a > 8^ is also easy. 

To work out exactly what the constraints are requires some computation. It would 
be possible, though very tedious, to do this entirely by hand. We have aimed to 
use the computer as little as possible, and so have developed quite a few techniques 
for estimating the error. Since y{a) — | is negative when a = 7^, we must treat 
these points separately, by purely arithmetic means. Thus in this case we simply look 
for suitable solutions (d;m) of the Diophantine equations (1.8) with centers at these 
points, using Lemma 2.1.7 to limit possibilities. This computation (in Lemma 5.2.5) 
finds several classes that contribute to c(a) as well as some interesting "hidden" classes 
for which fi(d;m)(a) = c(a) at just one point, namely the center. There are some 
other obstructive classes centered at points of the form a = 7 2 k+i • (The table in 
Theorem 5.2.3 lists all classes that contribute to c(a).) We show that there are no other 
obstructive classes in §5.2 using estimates developed in §5.1 as well as two computer 
programs that are described in Appendix B. 

Finally, Appendix A explains the connection between our current definition of the 
weight expansion of a in terms of the continued fraction expansion of a and the defi- 
nition used in [15], which came from a blow up construction. The results here are no 
doubt well known; we included them for the sake of completeness. 

Acknowledgments. We wish to thank Dylan Thurston for making some very helpful 
observations at the beginning of this project (he was the first person to point out a 
connection with Fibonacci numbers); Michael Hutchings for explaining the obstructions 
coming from embedded contact homology; and Helmut Hofer and Peter Sarnak for their 
inspiration and encouragement. 



2. Foundations 

2.1. Basic observations. Given a with weight expansion w(a) of length i{a) = M 
and (d; m) G £m, we define e := e(a) = (e±, . . . , em) by setting 

(2.1) m = -^=w(a) + e. 

We will refer to the vector e as the error, and to quantities such as as the 

squared error. We need to understand the function fj,(d;m)(a) = m • w(a)/d defined 
in Corollary 1.2.3. 

Our arguments will be based on the following observations. 

Proposition 2.1.1. For all (d;m) G £ and a, we have 
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(i) n(d;m) := fi(d;m)(a) < \J~a~\J\ + 1/d 2 ; 

(ii) fi(d; m) > yja <^=^ e ■ w > 0; 

(iii) // n(d; m) > y/a, then E := s ■ e = ^sf <\; 

(iv) Let y(a) := a + 1 — 3y/a where a = p/q. Then 

(2-2) -£* = ! + "J)- 

Proof. Lemma 1.2.6 and Proposition 1.2.12 imply that 

)ti(cZ; m) d = t« • m < \\w\\ \\m\\ = \fa\J d 2 + 1. 
This proves (i). (ii) is immediate, while (iii) follows from (ii) because 

d 2 + 1 = m • m = (^w(a) + e) ■ (^w(a) + e) = d 2 + 2^w(a) -e + e-e. 
Finally, to prove (iv) observe that 

3d-l = Em = ^(a + l-i)+E^- 

Hence 

^(a + l-3^)-^ + l + E^ = 0. 
This completes the proof. □ 

Remark 2.1.2. (i) Proposition 2.1.1 (iii) implies that an element (d;m) 6 £ gives an 
obstruction at a (i.e. has fj,(d;m)(a) > \/a) only if the vector m is "almost parallel" to 
the vector w(a). In particular, if we are interested in solutions that provide obstructions 
when a = k + x for x € (0, 1) we need the first k entries to be equal within the allowable 
error. As we will see in Lemma 2.1.7 below, this means that the first k entries of m 
must lie in the set {mi, mi — 1}, with at most one entry different from the others. Some 
elements of £ with d < 9 that satisfy these conditions for a £ [6, 8] are 

(2;l x5 ), (3;2,l x6 ), (5;2 x6 ,l x2 ), (8;3 x7 ,l x2 ). 

It turns out that these elements all do give obstructions. 

(ii) Another noteworthy point is that y{a) = when a = r 4 . Therefore (iv) gives 
most information when a — r 4 is quite large, e.g. if a > 7; see §5.1. O 

The next result explains the basic structure of the constraints. Throughout we write 
i{m) for the number of positive entries in m, and £(a) for the length of the weight 
sequence w(a). 

Lemma 2.1.3. Let (d;m) G £, and suppose that I is a maximal nonempty open 
interval such that \fa < fi(d;m)(a) for all a £ L. Then there is a unique ao £ L such 
that £(ao) = £(m). Moreover 1(a) > i(m) for all a £ L. 
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Proof. Denote by Wi(a) the ith weight of a considered as a function of a. Then it is 
piecewise linear, and is linear on any open interval that does not contain an element a' 
with length £(a') < i. That is, the formula 4 for W{(a) can change only if it or one of 
the earlier weights becomes zero. 

Therefore if £(a) > £(m) for all a £ I, the function fi(d;m)(a) is linear in /. But 
this is impossible since the function y/a is concave and I C (1, 9) is bounded. 

Thus there is ao € / with £(ao) < £(m). On the other hand, if 1(a) < £(m), then 
J2i<e(a) m? <d 2 + l, so that 



\w ■ m\ < \\w\\ / ^2 m i — d\\w\\ = d\fa. 

y *<^(a) 

Hence fi(d;m)(a) < y/a, i.e. a £ I. 

The uniqueness follows from the properties of continued fractions. We claim that if 
b > a and £(b) = £(a) then there must be some number y G (a, b) with £(y) < £(a). 
Since such y cannot be in /, this gives the required uniqueness. To prove the claim, 
let a have continued fraction expansion [£q; £±, . . . , £^\ and consider the functions Xj(a) 
as in equation (1.5). If N is even, the function xjy(z) decreases as z increases. Hence 
£]\j increases and so the next number z > a with length < £(a) is [£q\ £\, . . . , £n-i] which 
has length < £(a). Similarly, if we look for numbers z < a with £(z) < £(a), then the 
first one is [£q; . . . ,£n — 1]. Similar arguments apply if N is odd. □ 

Corollary 2.1.4. Suppose that c(a) > y/a. Then 

(i) There are (possibly equal) elements (d^^m* 1 ) G £ and e > such that 



c(z) 



n(d ;m )(z) for all z£(a — e,a], 
ji(d + ;m + )(z) for all z€[a,a + e). 



(ii) On each of the intervals in (i) there are rational numbers a,/3 > such that 
c(a) = a + j3a. 

(hi) The set of (d;m) such that c(a) = /j,(d;m)(a) is finite. 



Proof. Since c(a) > %/a, there exists DeN with \Jl + 1/D 2 < c(a) / \fa. If (d; m) € £ 
is such that fi(d;m)(a) = c(a) > \/a, then d < D by Proposition 2.1.1 (i). But there 
are only finitely many elements (d;m) G £ with d < D. Since c(a) is continuous by 
Lemma 1.1.1, we must have \fl + 1/D 2 < c(z)j\fz for all z sufficiently close to a. 
Further, as we have seen in the proof of Lemma 2.1.3, each function fi(d;m)(z) is 
piecewise linear, and it has rational coefficients because the weight functions Wi(z) 
do. Hence, c(z) is the supremum of a finite number of rational linear functions. This 
proves (i) and (hi). Moreover, if near a we write c(z) = a + (3z for some rational 
numbers a, j3, then f3 > since c is nondecreasing, while a > because of the scaling 
property in equation (1.2). □ 



See Lemma 2.2.1 for an explicit expression. 
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Let us call an element (d; m) G £ perfect if m is a multiple of the weight vector 
w(b) of some b > 1. The next lemma combined with Corollary 2.1.4 shows that these 
elements determine c(a) for a near b. 

Lemma 2.1.5. Suppose that (d;m) G £ is perfect: m = Kw(b) for some b > 1. Then 

(i) fi(d;m)(b) = c(b) > \fb, and (d;m) is the only class with fi(d;m)(b) = c(b). 

(ii) m = qw{b) where b = p/q in lowest terms, and b < r 4 . 

Proof (i) Since (d; m) G £ and m = Kw(b), we have d 2 < d 2 + 1 = m ■ m = 
K 2 w(b) ■ w(b) = K 2 b, whence d < K\fb. Therefore, 

I J MVS m-w(b) Kb Kb rr 

,(d;m)(b) := — ^ = - > = Vb. 

Let (d'; m') £ £ be another solution. Since (d'; m!) / (d; m), positivity of intersections 
(part (ii) of Proposition 1.2.12) shows that dd' > m ■ ml = Km' ■ w(b). This and 
d 2 < d 2 + 1 = m ■ m = Km ■ w(b) yield 

/ ,/ /w,n m' ■ w(b) d m-w(b) , , , .,, 
H(d!;m')(b) = j±± <-< j^- = »(d;m)(b). 

(ii) Let w(b) = (wi, . . . , u>m)- Since wm = | and tum £ ^, we have k = sq for some 
integer s. Equations (1.8) and Lemma 1.2.6 give 

3d -I = y^m^ = K,y~]wj = sq(b+l — ^), 

d 2 + l = J> 2 = K 2 ^ = (sq) 2 b. 

If s = 1, then 3d = g(6 + 1) so that 

1 + b - 3Vb = 3^ - 3^±i~ < o. 

Thus Vb < t 2 . 

Otherwise, adding the two above equations gives s\d(d + 3). But the first equation 
shows that s, d are mutually prime. Therefore s|d+3 and s\3d— 1; hence s| 10. Therefore 
s = 2, 5 or 10. 

The identity (d + 3) 2 = (d 2 + 1) + 2(3d - 1) + 10 shows that s 2 \2(3d - 1) + 10. If 2\s 
this means that d is even which is impossible since 3d — 1 is even. 
If s = 5 then 3d + 4 = 5^(6 + 1) so that 

(2.3) y(b) := 1 + b - 3\fb = ^ (3d + 4 - 3\/d 2 + 1) > 

Thus > t 2 . But this is impossible: For (d;m) G £ must have nonnegative intersec- 
tion with the class (3; 2, l x6 ) G £. Therefore 

2777-1 + 771,2 + • • • + 777-7 < 3d. 

If b G [r 4 , 7], then m = 5(/(l x6 , 6 — 6, ... ), and we obtain 

5q(l + b) < 3d, 
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a contradiction. Assume now that b = 1'- > 7. Assume first that b = 7|. Then 
3d — 1 = 40(7 an d d 2 + 1 = 25q 2 (7 + |). Solving the first equation for d and inserting 
the result into the second equation, we get the equation 

5q 2 -29q + 2 = 

whose solutions are not integral. Thus b = 7 r - with r > 2. By (2.3), 

_ 3d + 4 - 3V0WT1 4 4 
Q ~ 5yjb) < %(&) < %(7) < ' 

whence g < 12. Thus 6 > 7^, and so g < r ^ < 6, whence q < 5. Thus b > 7|, and 
so g < 5 ^^, 2 ^ < 4, whence g < 3. Thus 6 > 7|, and so q < 2 ^ < 3, whence q < 2. 

Thus b > 8, and so g < < 2, whence </ = 1. Thus b > 9, and so g < < 1, 
which is impossible. □ 

Remark 2.1.6. We show later that the only perfect elements are those at the num- 
bers b n of the Fibonacci stairs; see Corollary 3.1.3. However there are many nearly 
perfect elements that are relevant to the problem such as the classes E{a n ) of The- 
orem 3.1.1 and the classes E(b k (i)} of Proposition 4.2.2. These elements are perfect 
except for some adjustments on the last block. O 

The next lemma expands on the first part of Remark 2.1.2. 

Lemma 2.1.7. Assume that (d; m) £ £ is such that (i(d;m)(a) > \J~a~. Let J := 

{k, ... ,k + s — 1} be a block of s > 2 consecutive integers for which w{ai), i £ J, is 
constant. Then 

(i) One of the following holds: 

m k = ... = m fc+s _i or 

m k = ... = m k+s _ 2 = «ifc+s-i + 1 or 

m k - 1 = m k+ i = ■■■ = m k +s-i- 

(ii) There is at most one block of length s > 2 on which the rrii are not all equal. 

(iii) If there is a block J of length s on which the rrii are not all equal then £ 1 — 

s-1 
s ' 

Proof. Let Wi(a) = x for i G J. By Proposition 2.1.1 (iii) we have 

k+s-l k+s-l 

E |$H 2 = £ e - 2<L 

i=k i=k 

Thus {m k , . . . , m k+s _i} can contain at most two different integers, which must be 
neighbors if they are different, say m, m + 1. We can also clearly assume that m < 
4^ < m + 1. Therefore (i) holds when s < 4. 
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So suppose that s > 4 and assume that m + 1 occurs t times. Set v = — n G [0, 1), 
where n G Z. Then the squared error on this block is 

fc+s-l 2 fe+t-l fc+s-l 

E |*-H = E i«-ii 2 + E h 2 

i=fc i=fe i=k+t 

= t(v - l) 2 + (s - t)v 2 

> for all v G (0, 1) if t € {2, . . . , s - 2}. 

But 2(s — 2)/s > 1 when s > 4. This proves (i). Parts (ii) and (hi) follow from the 
fact that the minimum squared error on a block of length s on which the mi are not 
all equal is 1 — \ > \. □ 

The following lemma will also be important for detecting potentially obstructive 
solutions (d; m). 

Lemma 2.1.8. Let (d;m) £ £ be such that fi(d;m) > for some a with 1(a) = 
£(m) = M. Let w^+i, ■ ■ ■ ,Wk+ s be a block, but not the first block, of w(a). 

(i) If this block is not the last block, then 



\m k -(m k+1 -\ \- m k+s + m k+s+ i)\ < y/s+~2. 

If this block is the last block, then 

\m k - (mfc+i H h m k+s )\ < Vs + 1. 

(ii) Always, 

M 

m k — m i < VM — k + 1. 

i=k+l 

Proof, (i) We prove the first claim, the second claim is proven in the same way. By 
definition (2.1) of the errors, rrii = ^=«>j + Ei for all i. Since w k+ \ = ■ ■ ■ = w k+s and 
w k+s+ \ = w k — sw k+ i, we have that 

m k - (m k +i H h m k+s + m k+s+1 ) = e k - (e k+ i H h e k+s + e k+s+1 ) , 

and so 

\m k - (m k +i H h m k+s + m k+s+1 )\ < \e k \ + \e k+1 \ H h |efc+ s | + |£fc+ s +i| • 

Since e ■ e = J2 £ i < 1 by Proposition 2.1.1 (hi), the latter sum is < \fs + 2. 

(ii) If the block iWfc+i, . . . , w^+s is the last block, then M = k + s, and so the stated 
estimate is the same as in (i). So assume that w k +i, ■ ■ ■ ,w k + s is not the last block. 
Since w k = sw k+ i + w k + s +i, we then have w k < Yli^k+i w i- Therefore, 

M M M M 

m k < -e k + ^2 (rrii + ei) < ^ mj + ^|ei| < ^ m, + \[M - k + 1, 

i=k+l i=k+l i=k i=k+l 
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as claimed. □ 



2.2. Some identities for weight expansions. This subsection establishes some 
rather surprising identities for weight expansions. 

Let a > 1 be a rational number and consider its continued fraction expansion 



(2.4) a := [£ ;h,...,£ N ] 



e 2 +... 

Usually, to avoid ambiguity, we assume that £n > 2, but in this section only it is 
convenient to permit the case £jv = 1 as well. We define the sequence a a := (a")^, 1 
by setting «q = 1, af = —£q, and 

(2.5) a] = a]_ 2 - i^a^ j = 2, . . . , N + 1. 

Similarly, define f3 a = (Z^)^ 1 by the same recursive formula, but starting with /3q = 0, 
Pi = 1. Thus f3 a does not depend on £$. Both sequences alternate in sign. 

We chose the notation a a , f3 a for these sequences because, as we now see, they are 
the coefficients of the linear functions Wi(z) for z lying on the appropriate side of a. 
Write the weight expansion w(a) of a as 

10(a) = (l x£o ,(x 1 (a)) x ^,...,(x w (a)) x£ -) = ( l^^l, , . . . ) . 

to t\ 

Then the weights Xj := xj(a) satisfy the recursive formula (2.5) with xq = 1, and 
x\ = a — £q, so that Xj = a a - + af3j for j < N. If N is odd, this formula for Xj(z), 
j < N, continues to hold for all z < a that are so close to a that the Xj(z) are positive. 
In this case, z = [£q; £i, . . . , £n , h, . . . ] where h > 1. Similarly, if N is even, it holds for 
z > a and sufficiently close to a. This proves the following result. 

Lemma 2.2.1. Let a and N be as above. Then if N is odd there is e > and h > 1 
such that for z £ (a — e,a) we have 

w(z) = (l^,(x 1 (z)) xe \...,(x N (z)) xeN ,(x N+1 (z)) xh ,...) 

where Xj(z) = a® + z/3j for j < N + 1. If N is even, the same statement holds for 
z £ (a, a + s). Moreover, in both cases Xj(z) is an increasing function of z for j odd, 
and a decreasing function for j even. 

We now give a second description for the sequences w(a), a a , and (3 a in terms of 
the convergents of a related number a (the mirror of a) which helps to explain their 
symmetry properties. 

Definition 2.2.2. Let £j > 1 for < j < N, where N>1. We define 
a := [£q;£i, ■ ■ ■ ,£n], a := [£n;£n-i, ■ ■ ■ ,£q], 



22 DUSA MCDUFF AND FELIX SCHLENK 

and call a the mirror of a. The convergents (a)k, 0<k<N,toa are defined by 
setting 

(ajfc := 4;«i,...,4 =: — =: — , 

where pk(a), qk(a) are the numerator and denominator of the rational number repre- 
sented by (a)fc. In particular, (a)jv = Pn/qn = a, and /or s/iort we write a = a. 
We define the normalized weight sequence of a as 



W{al) :=q N w(a)=((X (a)) xeo ,...,(X N (a)y 



In particular, Xn = 1 always. Finally, we define the (signed) mirror of a sequence 
W:= (X* to ,xf\...,X* iN ) to be ' 

W := {-X N - x y t »-\. . . , ((-l) N X ) xe °) 

where we reverse the order and change signs. 

Note that the sequence of weights w(a) is independent of the ambiguity in the lj, 
but its block description and the Xj do depend on this choice. 

The first part of the next lemma is well known. We then show that the normalized 
weights of a = a are equal to the numerators of the convergents of a. Further the 
coefficients a a (resp. f3 a ) are the numerators (resp. denominators) of the convergents 
of a = a shifted by 1. Define p~i(a) = 1, q~i(a) = 0. 

Lemma 2.2.3. Let a = a be as above. Then: 

(i) a = pN(a)/pN~i(a); in particular, pjv(a) = pjv(a). 

(ii) For < j < N , we have 

Xj(a) = \a%f-j\, X N _j(a) = \af \. 
Further |a"| = pj^\(a) for all a and < j < N + 1. 

(iii) Define u = u := [4; . . . , 4v]- Then u = (a)jv-i, and for \ <j<N + \we 
have 

\p»\ = \a]_ 1 \ = q j - 1 (a). 
Proof. The following matrix identity holds by induction on k: 

( 4 1\ / Pk(a) p k -i(a)\ 



(2.6) 




1 0/ \qk(a) q k -i(a) 



(i) follows by considering its transpose. 

To prove (ii), observe that the elements Xj := Xj(a) are decreasing positive integers 
with Xn = 1, Xn + i = 0, and by Definition 1.2.5 may be defined backwards by the 
iterative relation 

XN-j-i = Xjsr-j+i + Zn-j Xjsr-j, j > 1. 
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Similarly, a$ = 1 and, if we set a_i = 0, the identity (2.5) implies that the \a°j\ are 
increasing positive integers satisfying 

K + il = K-il+^KI' j>o. 

But a = [£' ; £' N ] where ^ = £ N -j. Therefore X 3 (a) = \a%_j\ for j = 0, . . . , AT. 
Since a = a, we also have that Xj(a) = |a$- -| for j = 0, . . . , N. 

To understand the relation between a°j and := pj_i(a), note first that po = 
lo = \a\\. Further, (2.6) implies that pj = Pj-2 + ^jPj-i- Therefore \a°j\ = pj-\(a) for 
j = 0, . . . , N + 1, as claimed. This proves (ii). 

The first claim in (iii) is obvious. To prove the second, note that the identity 
= follows immediately from the definitions of a and (3. Therefore, by (ii), 

= \a^_]\ = pj^2{u). But if v := [0; £\, . . . , £n], then ^ = v = a — [a\. Hence 
Po(u) = qi(v) = qi(a), and more generally, p k (u) = q k +i(a). Hence = q k -i(a). □ 



Corollary 2.2.4. 

W(a) ■= ((^(S)) x£o ,(-^_ 1 (S)) x£l ,...,((-l)^ (S)) x£ -) 

= ((aS)^,(a?) X ^,...,(a^) X ^) 

We now show that there is a mirror version of the quadratic relation w(a) -w(a) = a. 
Since W(a) = q]y(a) w(a), this relation is equivalent to the identity 

W{a)-W{a) = p N {a)q N {a) 

We need the following lemma. 

Lemma 2.2.5. Let b G Q have weight expansion w(b) = (l x£ °, (yi) x£l , ■ ■ ■ , (yN) x£N ), 
and set hn+i '■= 0. Then, if N = 2 J + 1 is odd 

J J 
V2k = ^2 £2j+i V2j+i, V2k+i = 2/2,/+ 1 + ^2 £23 V2j for each k > 0, 

j>k j>k 

while if N = 2 J is even 

J J-i 

V2k+1 = ^2 £2j V2j, V2k =V2J + V2 i +1 f° r eaCfl k ^°- 

j>k j>k 

Proof. This follows immediately from the construction of the weight expansion as in 
diagram (1.3); interpret each sum as the length of a side of some subrectangle. □ 

Proposition 2.2.6. Let a = [£q; . . . ,£n]- Then W(a) ■ W(a) = pjv(a) if N is even, 
and = if N is odd. 

Proof. Write 

W{a) := (X* e », Xtf»), W(a) := {Y^ , 
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Then W(a) = (Y* E ° , . . . , {(-l) N Yo) xttf ) . Hence Lemma 2.2.3 part (i) implies that 

(2.7) p := PN tf) = £ X + X 1 =£ N Y + Y 1 . 

Suppose first that N = 2 J. We must show that 
J j-i 

(2.8) ^ ^3 Y 2(J-j) X 2j =P+^2 y 2(J-i)-l X 2j+1- 
3=0 3=0 

Lemma 2.2.5 implies that 

k 

(2.9) ^2(J-fc)-i = ^2 hj Y 2(J-j) for each k < J. 

j>o 



Hence 



J-l J-l 



^2l2j+lY 2 (j- j )- 1 X 2 j + l = ^^ 2 j+1^2j+l(^^2s^2(J- S ))) 

jr'=0 S<j 

J-l .7-1 
^2 {^2s Y 2 (J- S ) tlj+l X 2j+l)) 



3=0 3=0 s<j 

J-l J-l 



s=0 j>s 
J-l 

^ hs *2(J-s) (-^2s - ^2j) 



s=0 
J-l 



^ ^2s ^2(J-s) -^2s - -^2J^1, 



s=0 

where the penultimate equality follows from Lemma 2.2.5 and the last one uses equa- 
tion (2.9). Therefore equation (2.8) will follow if we show that (£nYq + Y\)Xn = P- 
Since Xn = 1, this holds by equation (2.7). 

The proof when N is odd is similar, and is left to the reader. □ 

Corollary 2.2.7. Let Xj := Xj(a), where a = [£o',h, ■ ■ ■ An] > 1 and define ctj,(3j as 
in equation (2.5). Then: 

(i) // N is even, J2f=o ^3 x 3 a j =a an d J2f=o tj x j Pj = 0/ 

(ii) // N is odd, Zf=o lj *j (*j = and £j Xj ^ = 1. 

Proof. The sums involving a have the stated value by Proposition 2.2.6 and Corol- 
lary 2.2.4. To prove the claims involving /3 a , write 



u 



— [h'A2, ■ ■ ■ An] — [(■'oA'i-, ■ ■ ■ A'n']i 
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where N' = N - 1. Note that X^u) = X i+1 (a), for < i < N' . Thus, with 0g 
and since = af where i = j — 1 by Lemma 2.2.3 (iii), we find that 



N 



q N (a)J2 e j x j( a )Pj 

3=0 



N 

E- 



i=0 



By what we have already shown, this sum is when N' is odd (i.e. N is even) and 
equals the numerator pn'{u) of u when N' is even (i.e. N is odd). But pn'(u) is 
the denominator of \ = [0; £±, . . . , £n] = a — [a\, and so equals (?jv(a). The result 
follows. □ 

2.3. The nature of the obstructions. We saw in Corollary 2.1.4 that near each 
point a where c(o) > y/a, the function c is the supremum of a finite number of piecewise 
linear functions /j,(d; m), and that each linear segment of c has the form z >->■ a+fiz with 
rational and nonnegative coefficients. The next example shows that the coefficients of 
the functions fx(d; m), though rational, are not restricted in this way even if we suppose 
that fi(d;m)(z) > \[z. 

Example 2.3.1. Consider the class (d;m) = (l0;4 x6 ,l x5 ) in S. (Under the name 
E(a2), this class will play a role in Section 3.) Abbreviate fj,(z) = fi(d;m)(z). We 
compute dfi(z) = 10 fi(z) on the interval [6,6^]. 



on 


h = 


[6,6|] : 


-6 + 5z 


on 


[6 ,6±], 


25 




on 


[H 


63 


on 


h = 


[6i,6|] : 


Az 


on 


[63.6?]. 


44- 


3z 


on 


[6f 


65 


on 


h = 


[6|,6f] : 


-13 + 6z 


on 


[6|,6|], 


38- 


2z 


on 


[61 


61 


on 


h = 


[6|,6i]: 


6 + 3z 


on 


[6|,6f], 


51 - 


Az 


on 


[6f 


65 



see Figure 2.1. The figure also shows the graph of \fz and of c(z) (dashed) on [6, 6^] , 
which by Theorem 1.1.2 (i) is 

c(z) = § on [6,6|] , c(z) = \z on [6|,6|] . 



Note that £(ra) = £(a) = 11 at 6|, 6|, 6|, 6|. Also note that at a = 6| we have 
//(a) > while at a = 6| we have jiia) < \fa. 

Similar results hold for the functions fi(d;m) given by the classes E(a n ), n > 2, of 
Theorem 3.1.1. For example, one can use Corollary 2.2.7 to show that these functions 
equal c(z) for z near a n . O 

We now show that although the coefficients a, j3 may be negative, they are somewhat 
restricted. 

Proposition 2.3.2. Let (d; m) £ £ and a € Q be such that £{m) = £{a) and 
fi(d;m)(a) > y/a. Write a =: p/q in lowest terms, let m := mu be the last nonzero 
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Figure 2.1. The graph of ji on [6, 6|] . 



entry in m and let I be the connected component of the set {z \ fj,(d; m)(z) > *Jz} that 
contains a. Then there are integers A < p and B < (m + l)q such that 



dfi(d; m)(z) 



\ A + Bz if z < a, z € I, 

[ (A + mp) + (B — mq)z if z > a, z € I. 

We begin the proof by establishing the following lemma. 



Lemma 2.3.3. Consider (d;m) £ £ and a = | (in lowest terms) such that £(m) = 
£(a) =: M. Let d[i(d; m)(z) = A + Bz on a nonempty interval of the form (a — e, a). 
Then there is e' > so that for z' € (a, a + e') 

dfi(d;m)(z') = A + Bz' + m(p - qz') =: A' + B'z'. 

Proof. Suppose first that N is odd. Then by Lemma 2.2.1 when z < a, Xj{z) = aj+zf3j, 
for j < N + 1. Further, 

(2.10) x N+ i(z) = x N _ 1 (z) - l N x N (z) = a a N+1 + zf3% +1 = p- qz 

where the last equality holds by Lemma 2.2.3. 

When z' is just larger than a, its iVth multiplicity is £n — 1, and £n+i is very 
big. (Since £n > 2 this still gives an allowed set of multiplicities.) Hence for such z' 
the formula for the linear functions Xj(z'),j < N, is unchanged, but now x' N+l {z') = 
xn-i{z') — (£n — ^)xn(z'). (For clarity we denote by x' N+1 the formula that holds for 
z' > a and by xn+i the formula that holds for z < a.) Note that because £{m) = £(a), 
just one term from the (iV+l)st block is counted in /j,(d; m)(z'). Hence, with m := tum, 
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we have 

dfj,(d; m)(z') — (A + Bz) = —mxiy(z') + mx' N+1 (z') 

= m(-x N (z') + x N -i(z') - (£ N - l)x N (z')) 

= m(x N -i(z') - £ N x N (z')) = m(p - qz'), 

where the last equality uses equation (2.10). 

Now suppose that N is even. Then the formulas Xj(z') := a® + z(3j give the (be- 
ginning of the) weight expansion for z' just larger than a. As above, when z is just 
less than a, we must modify the last multiplicities of a, reducing In by 1, and making 
In+i arbitrarily large. Thus as above, the formulas for the weights Xj(z), j < N, are 
unchanged but that for the (N + l)st weight is modified. As above we denote by x' N+1 
the formula that holds for z 1 > a and by xjv+i the formula that holds for z < a. Then 
x' N+1 (z') = — p + qz' > 0. Further, if dfj,(d;m)(z') = A' + Bz' for z' > a, we find for 
z < a that 

dfj,(d; m)(z) — (A' + B'z) = — mx^{z) + mxN+i{z) 

= m(-x N (z) + x N -i(z) - (£ N - l)x N (z)) 
= mx' N+1 (z) = -m(p — qz). 

Therefore A + Bz = A' + B'z — m(p — qz), as claimed. □ 

To complete the proof of Proposition 2.3.2 we need to estimate the size of A, B. Here 
is an auxiliary lemma. 



Lemma 2.3.4. Let £q',£i, ■ ■ ■ ,£n be any sequence of positive integers with £n > 2, and 
let rjj, j > 0, be one of the sequences Then X^jLo^'l^il 2 < ^Iw+il 2 - 

Proof. By definition rjj = rjj-2 + lj-\T]j-\. The inequality 



(2.11) ME^ 2 ] 

holds for k = 0, and may be proved for all larger k by induction. Setting k = N yields 
the lemma. □ 

Proof of Proposition 2.3.2. Suppose first that iV is odd, and write rrtj = -^Wi{a)+Si 
as in equation (2.1). For notational convenience, let us first assume that the nii are 
constant on each of the blocks of length £j. Then define rij to be this constant value 
on the j'th block. If this assumption holds, then the are also constant on the blocks, 
and we denote their values by Sj. Then dfi(m; d)(a) = A + Ba where, by Lemma 2.2.1, 
we have 
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Therefore, substituting nj = + Sj, we find 

A = E *i "i n 'j = E 7a*3 X i a i + E 

= o+Evk 

^ (E^f) 1/2 (E^Ki 2 ) 1/2 < v^i^+ii < v. 

Here we used Corollary 2.2.7 for the third equality, and for the inequalities used the 
Cauchy-Schwarz inequality, Yl^j^j ='■ E < I from Proposition 2.1.1, Lemma 2.3.4 and 
finally the fact that , 1 | = p from Lemma 2.2.3 (ii). 

This argument is also valid if the are not constant on the blocks. In this case, 
by Lemma 2.1.7 the values of rij and Sj may vary by 1 over the entries of one block, 
but that variation can be absorbed into the sum that gives y/~E and will not increase 
it above \/E + 1 < \/2. 

Similarly, 

= * + EWi =--js + s, 

where S := Ylj<N ^i'^ifi'j ■ By definition, m = tum = ~4^ X N + $N, and = |. 
Therefore, assuming that the m; are constant on the blocks we have 

B - (m + l)q = ^=-qm-q + S= -q(l + 5 N ) + S. 

We need to show that S < q(l + Sn) = \Pn+i \ (1 + If &N > we may estimate S 
as before by 

5 < v^(E^(/?i) 2 ) < TTll^+il < «■ 
i=o 

Now assume that <5at = —5 is negative and note that > because iV is odd. 
Therefore 

_ _ N-l 1/2 

^ == E W° ^ "^^N + E e 3 ^ ) 2 ) ^ ^ " M ' 
j<N j=0 

where we used equation (2.11) with k = N — 1 and £n-i > 1- Since < |/3^ +1 |/2 = 
g/2 by the inductive formula, the desired result follows easily. 

Suppose now that the rrij are not constant on the jth block. If j < N, then, as 
before, we simply need to replace E by E + 1 in the above estimates. It is easy to check 
that the argument still goes through. 

It remains to consider the case when the mi are not constant on the last block. 
Define 6n again by m = mu = 4w + $N- By Lemma 2.1.7 the last block of m is 

either (m + l) x£ , m with errors (5n + l) x ^, Sjy', or m + 1, m x ^ with errors Jjv + 1, <^> 
where £ := £n — 1- Note that 5/v =: — <5 is negative. The sum 5jv of 5j/3? over the 
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last block is either (£(1 -5)- 5) (3% or ((1 - 8) - £5) (3 pj. Since £ > 1, in either case 
Sn < (^(1 — 8) — 8) (iff. But because (£ + 1)(3% < = Q-> we can estimate 

— (m + l)g as follows: 

B -(m+l)q = - q (l-6)+J2 j<N £ j 8 j P« + S N 

< -ffi((£ + l)(l-8)-VE-£(l-8)+8) <0. 

This completes the proof when N is odd. The case when N is even is similar, and is 
left to the reader. □ 

2.4. Connection to the lattice counting problem. In this section we prove The- 
orem 1.1.3, stating that cech (a) > c(a) for all a > 1. Recall that for a > 1, 

CECH(a) := inf {n > | N(l, a) 4 N(p, fj,)} . 

The first step is to describe cech in another way. As Hutchings pointed out 5 , the 
inequalities N(l,a) =4 N(fi,fi) can be understood in terms of counting lattice points 
in triangles, as follows. Let a > 1 be irrational. For each pair of integers A, B > 0, 
consider the closed triangle 

T% B := {(x,y) e M 2 | x,y > 0, x + ay < A + aB}. 

Thus the slant edge of T\ B has slope — \ and passes through the integral point (A, B). 
Then the number #(T^ B nZ 2 ) of integer points in the triangle T\ B is just the number 
of elements in N(l, a) that are < A + Ba. We define 

(2.12) k AB (a) ■= 
where d is the smallest positive integer such that 

#(TX B nZ 2 ) < l(d+l)(d + 2). 

(Note that N(l, 1) = (0, 1, 1, 2, 2, 2, 3, 3, 3, 3, 4, . . . ) has precisely ±(d+l)(d + 2) entries 
that are < d.) Further, set 

K(a) := sup {k AtB {a)} . 
A,B>0 

We extend the function K to rational a by defining 

(2.13) K(a) := sup K(z). 

z<a, z irrat 

Lemma 2.4.1. K{a) = cech{o) for all a > 1. 

Proof. For each A > 1 we have N(l,Xa) ^ XN(l,a). Therefore, the conclusions of 
Lemma 1.1.1 hold for c B ch as well as for c. In particular, cech is continuous and 
nondecreasing. Therefore, (2.13) also holds for c B ch- It hence suffices to prove the 
lemma for irrational a. 

Fix an irrational a. If cech{o) < K(a) then one can find a rational number fi > 
cech{o) and non-negative integers A, B with fi < kA,B(a)- Since fi > cech(cl) we have 

^Private communication. 
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N(l,a) =4 N(fi,fj,). This inequality implies that for all non-negative integers A, B we 
have 

#{p£ iV(l, a) I p < A + Ba} > #{p£ N(p, p)\p<A + Ba}. 
The number on the left is #(T% B r\T?) , while the number on the right is \{D+\)(D+2), 
where D := [ ^^" J • This must be a strict inequality for some A, B. To see this, let u = 
(u\,U2,us, . . . ) be the sequence of natural numbers obtained by arranging in increasing 
order all the numbers on the LHS obtained by running through all pairs of integers 
A, B > 0. Since a is irrational, each number in N(l, a) occurs with multiplicity 1. The 
definition of N(l,a) therefore shows that u = (1,2,3,...). On the other hand, the 
numbers in N([i, fi) appear with larger and larger multiplicity. The sequence obtained 
in this way from the RHS therefore jumps by larger and larger amounts. 

Consider A, B such that this is a strict inequality. Then k Aj B{a) = ~~^r^ where 
d > D. On the other hand because a is irrational and \x is rational, D + l > > D, 

(J, 

so that 

k A , B {a) = ^ < 4^ < 

Since this contradicts our assumptions, we conclude that cech(o) > K(a). 

To complete the proof, it suffices to show that K(a) > jj, for all \i < cech{o)- For 
such fi we have N(l,a) ^ N(p,[i). Therefore there is A, B such that 

#{pe N(l, a) | p < A + Ba} < #{p£ NQi, n) \ p < A + Ba} . 

With D as before, this implies that d < D, so that k A ,B( a ) = ^ ^TT 1 - V- 

Hence K (a) = sup/cA,_e(«) > A* as required. □ 

We are now going to prove Theorem 1.1.3 by direct calculation, showing that for each 
of the constraints (d; m) that contributes to c(a) there is a triangle that contributes to 
K(a) in exactly the same way. Therefore we will assume the results of Theorems 1.1.2 
and 5.2.3. 

The key to understanding the relation between the functions k A ,B of equation (2.12) 
and the number of lattice points in the triangles T\ B is the following lemma, that was 
explained to us by Hutchings. 

Lemma 2.4.2. Suppose that a is rational, abbreviate T := T\ B , and suppose that 
#(TnZ 2 ) < i(d + l)(d + 2) + s-l = i(d 2 + 3d) + s, 

where s > 1 is the number of integral points on the slant edge ofT. Assume that (A, B) 
(resp. (A',B')) is the integral point on the slant edge with smallest (resp. largest) x- 
coordinate. Then there is e > such that 

K(z)>^f- ifze(a-e,a\, K(z)>^^- ifz€[a,a + e). 

Proof. Recall that cech and hence K is continuous. To prove the statement for z < a 
it therefore suffices to consider irrational z of the form z = a — e. Then, for small 
enough e > 0, the triangle T| B contains s — 1 fewer integral points than T. Therefore 
k A ,B{z) > A+ d B ' ) which proves the first statement. Similarly, the second statement 
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holds because if z = a + e is irrational and e > is sufficiently small, the triangle T|, B , 
contains s — 1 fewer integral points than T. □ 

Lemma 2.4.3. K(b n ) > ^/a n+ \ for all n > 1. 

Proof. Consider the triangle T n C M 2 with vertices (0,0), (g n+ 2,0) and (0,g n ), where 
g n is the nth odd Fibonacci number. Because g n ,9n+2 are mutually prime and satisfy 
the identities 

9n + 9n+2 = 3g n+ i, g n 9n+2 = 9 n +i + 1 
(see Section 3.1), we find that 

(2.14) #(T n nZ 2 ) = \{g n + l)(^ n+2 + 1) + 1 

= 5(Sn+l+3#n+l) + 2. 

Since 6 n = we have T n = Tg" n . In view of (2.14) we can apply Lemma 2.4.2 with 
s = 2 and d = g n +i- For some e > we have 

K{z) > ff- when ze(b n -e,b n ], 

and 

K(z) > when [6 n ,6 n + e). 
In particular, X (6 n ) = ^ = ^/a^T. □ 

Corollary 2.4.4. K(a) > c{a) for all a G [l,r 4 ]. 

Proof. First observe that c is the smallest continuous and nondecreasing function on 
[1, t 4 ] that is > y/a, has the scaling property of Lemma 1.1.1, and also satisfies c(b n ) = 
yjttn+i. On the other hand, we already remarked that cech and hence K is continuous, 
nondecreasing and has the scaling property. It is also easy to see that K{a) > y/a, 
because the number of integer points in a large triangle approximates its area. Therefore 
K(b n ) > c(b n ) implies that K(a) > c(a) over the whole interval. □ 

Proof of Theorem 1.1.3. By Corollary 2.4.4 we only need to show K > c on the 
interval [r 4 ,oo). Since, as remarked there, K(a) > y/a for all a, we just need to check 
that K(a) > u(d;m)(a) for all (d;m) that contribute to c. Recall from the proof of 
Proposition 1.2.9 that the class (3; 2, l x6 ) gives the constraint ^jjp- on [r 4 , 7]. Together 
with Theorems 1.1.2 (ii) and 5.2.3, we see that it suffices to check K(a) > fi(d;m)(a) 
for the nine classes in Table 2.15 below. Each of these classes contributes on both sides 
of its center point. It suffices to show that in each case there is a triangle that gives 
an equal constraint. Proposition 2.3.2 shows which triangles to take: if the constraint 
(d; m) is centered at a, then one should consider T := T\ B = Tj[, B , where u(d; m)(z) 

equals ^(^4 + Bz) to the left of a and ^(^4/ + B'z) to the right. Because c = /j,(d;m) 
in a neighborhood of the center point, this proposition together with Corollary 2.1.4 
implies that < A < p and mq < B < (m + l)q, so that the integral points (A, B) and 
(A 1 , B') = (A + mp, B — mq) are the first and last on the slant edge of T, as required by 
Lemma 2.4.2. Therefore, it suffices to check that in each case the coefficient d occurring 
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in (d; m) satisfies the condition in Lemma 2.4.2. Thus the number N(A, B) of integer 
points in T must be < N(d) := ^(d + l)(d + 2) + s — 1, where s = m + 1 is the number 
of points on the slant edge of T. In fact, as the following table shows we find that 
N(A, B) = N(d) in each case. 



(2.15) 



a 


(a; m) 


(A,B) 


(A',B') 


N(A,B) 


s 


TV T / 7\ 

JV(d) 


7 


(3;2,l xb ) 


(1,1) 


(8,0) 


11 


2 


11 


rrl 


/ /i o 1 ox7 o nx7\ 

(48; 18 ,3, 2 X 'J 


(7, 17) 


(121,1) 


1227 


3 


1227 


' 15 


(64;24 x7 ,3 x7 ,l x2 ) 


(14,22) 


(121,7) 


2146 


2 


2146 




(24;9 x7 ,2, l x6 ) 


(7,8) 


(57,1) 


326 


2 


326 




(40;15 x7 ,2 x6 ,l x2 ) 


(14, 13) 


(107,0) 


862 


2 


862 




(16;6 x7 ,l x5 ) 


(7,5) 


(43,0) 


154 


2 


154 


7± 


(35;13 x7 ,4,3 x3 ) 


(0, 13) 


(87,1) 


669 


4 


669 


1 2 


(8;3 x7 ,l x2 ) 


(7,2) 


(22,0) 


46 


2 


46 


8 


(6;3,2 x7 ) 


(1,2) 


(17,0) 


30 


3 


30 




Figure 2.2. The subdivision of the triangle T\ B . 

Here we calculate N(A, B) by subdividing T into five parts labeled a, . . . , e as in 
Figure 2.2. Each part besides 7 is half open, and includes the integer points on the 
heavy boundary edges but not those on the dashed boundary edges. For example, 
the rectangle j3 includes the integer points on the x and y-axes, but not those on the 
(dashed) edges shared by a, 7 or 5. Thus #(/3nZ 2 ) = AB. Further, because A,B 
(resp. (A',B')) is the integer point on the slant edge with smallest (resp. largest) x 
coordinate, we put all integer points on the slant edge into 7. Thus, we find that 



#( 7 nZ 2 ) = i((A'-A + l)(B'-B + l)-s}+s. 
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For example, in the case of the triangle Tf 17 with a = 7| and s = 3, the numbers of 
integer points in a, . . . , e are 7, 119, 979, 114 and 8, giving a total of 1227. 

This completes the proof of Theorem 1.1.3. □ 

Remark 2.4.5. (i) On the interval [7,8], there are four other classes (described in 
Table 5.14) with the property that fi(d;m)(a) = c(o) at their center points a = |, but 
that do not contribute otherwise to c(o). Let us look at their contribution to K. In 
each case (A, B) = (—1, mq), where m is the last nonzero entry in m, so that fj,(d; m) 
does not satisfy the scaling condition to the left. In the corresponding triangles the 
first point on the slant edge is (A\,Bi) = ( — 1 + p, (m — l)q) and one can check as 
before that in each case s = m and N(A, B) = N(d). Therefore to the left of each 
center point we obtain the inequality 

K{a) > k AljBl {a) = . 

In each case, one can check that kA 1 ,B 1 (a) is precisely c(a). For example, at 7| = ^ =: 
|, we get 56 \l 7 A 8a = I± § SL which agrees with the first line in Table 5.3. 

(ii) Recall from the introduction that the expected functorial properties of embedded 
contact homology should establish that cech{o) < c(a) for all a. For a < 6^, a = 9 
and for a > 11 one can prove this by directly showing that for each d the closed triangle 
with vertices (0,0), (dc(a),0), (0,d^-) contains at least i(c/ + l)(d + 2) lattice points. 
For other values of a the lattice point formula in Theorem 2.10 of [1] should be useful. 
This is work in progress. O 

3. The Fibonacci stairs. 
In this section we establish the behavior of c(a) for a < r 4 . 

3.1. Main results. Recall that the Fibonacci numbers f n for n > are recursively 
defined by 

(3.1) /o = 0, /i = 1 and f n+1 = f n + / n _i, n > 1. 

Denote by g n = fm-x-, n > 1, the sequence of odd Fibonacci numbers. The sequence 
g n starts with 

1, 2, 5, 13, 34, 89, 233, 610, 1597, 4181, 10946, . . . 
The recursion formula f n +i = fn + fn~i implies the recursion formula 

(3.2) g n+ i = 3g n - g n -i. 
Using this and induction we find that 

(3.3) g 2 n + 1 = g n -i9n+i- 
Set 



°« = (if) and b 



3n + 2 

n — „ 
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Then ■ ■ ■ < a n < b n < a n+ \ < b n+ \ < . . . . Since lim = ~v =: r, we have 

lim a n = lim b n = r 4 ~ 6.8541. 

n— >oo rt— >oo 

They key to establishing the Fibonacci stairs is the following result that states that 
there cire elements in S corresponding to the points (i n) b n . 

Theorem 3.1.1. (i) Let W(b n ) = g n w(b n ). Then E{b n ) := (g n+1 ; W(b n )) G £. 

(ii) Let W'(a n ) be the tuple obtained from W(a n ) := g^w(a n ) by adding an extra 1 
at the end. Then E(a n ) := (g n g n +i', W'(a n )^ G £. 

Corollary 3.1.2. Part (i) of Theorem 1.1.2 holds. 

Proof of Corollary. Since E(b n ) is a perfect element, Lemma 2.1.5 (i) shows that 
c{b n ) = »(g n+1 ;W(b n ))(b n ) = J^- w { bn )-w{b n ) = J^b n = = 

Suppose that c(a n ) > ^fa^ for some n. Then Corollary 1.2.3 implies that there is 
(d; m) e £ such that 

m ■ w(a n ) > d^fa~^. 

Note that (d;m) / (g n g n +i; W'(a n )) since W'(a n ) • w(a n ) = g\a n = g n g n +i y/a^- 
Therefore by positivity of intersections (part (ii) of Proposition 1.2.12) we must have 

dg n g n +i > m-W'(a n ) > glm ■ w(a n ), i.e. d^fa^ > m-w(a n ). 

It follows that c(a n ) = y/a^ for all n. Thus c{b n ) = ^a n+ \ = c(a n+ i). Moreover, 

c(b n ) _ y/an+l __ 1 __ c(q n ) 

bn bn yj CLn Q>n 

Hence c is linear on the interval [a n , b n ] by the scaling property. □ 

Corollary 3.1.3. The classes E(b n ) are the only perfect elements. 

Proof. On [1,t ], c(a) is given by the Fibonacci stairs. By Lemma 2.1.5 (i), the perfect 
element E{b n ) is the only class giving the constraint c(b n ) at b n . This and Proposi- 
tion 2.3.2 show that the step of the stairs over [a n ,a n +i] centered at b n is the con- 
straint n given by the perfect element E{b n ). Lemma 2.1.5 (i) now shows that there 
cannot be another perfect element on [l,r 4 ]. By (ii) of Lemma 2.1.5 there is no perfect 
element on [r 4 ,oo). □ 

We now turn to the proof of Theorem 3.1.1. The proof of part (i) is relatively 
easy; it is deferred to Corollary 4.2.3 since it is a special case of Proposition 4.2.2. To 
prove part (ii) we first need to show that the elements E(a n ) satisfy the appropriate 
Diophantine equations, which is accomplished in Lemma 3.1.4. Second, we must check 
that E(a n ) reduces correctly under Cremona moves. As we see in Section 3.3, the 
reduction process is quite complicated (and in fact is much more complicated than for 
the E(b n )), basically because the weight expansions w{a n ) involve quadratic rather 
than linear functions in the Fibonacci numbers. The intermediate Section 3.2 collects 
basic identities on Fibonacci numbers and explains an inductive procedure useful for 
checking identities on them. 
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Lemma 3.1.4. The tuples E{a n ) := (g n 9n+i',W (a n )) have integer entries and satisfy 
equations (1.8). 

2 

Proof. Consider (g n g n+ i; W'(a n )) . Since a n = - JL ^-, it follows from Lemma 1.2.6 that 
the last entry wm of w(a n ) is \. Therefore, the terms in W(a n ) = g"lw(a n ) and hence 
in W'(a n ) are all integers. Next, 

2 2 2 

= g n+ i + g n = g n+ \ + g n +ig n -i - 1 
= g n +i (flW+i + g n -i) - 1 = 3g n+1 g n - 1. 
Finally, W'(a n ) • W'(a n ) = g^a n + 1 = {g n gn+i) 2 + 1- This completes the proof. □ 

3.2. Identities for Fibonacci numbers. The proof that the classes E(a n ) reduce 
correctly involves many small calculations. To avoid having to do them explicitly, 
we first explain a general inductive procedure whose conclusions are summarized in 
Proposition 3.2.3. It is based on the following elementary result. Recall that fa denotes 
the A;th Fibonacci number defined in (3.1). 

Lemma 3.2.1. Given any three distinct numbers So,si,S2 > 0, there are rational 
constants A, fx such that f S2 +j = ^fs +j + M/si+j f or a ^ 3 > 0. 

Proof. The equations 

A /s + M/si = fs 2 i ^fs +l + M/si+1 = /s 2 +l 

fs fs 

have a unique solution because , S() / , 81 when so / si- Now apply the defining 
relation (3.1). □ 

We will frequently use the following relations between Fibonacci numbers. 
(3-4) fl = fa + ifa-i-(-l) k 

(3-5) hk-i = fl + fl-i 

(3-6) f2k = fk+i ~ fk-i 

Lemma 3.2.2. For each i > and s > 0, there is an identity of the form 

fs+ifs = y^ J CLijf2s+j + ( — l) S Qi 

i>o 

with a finite number of coefficients Cj,Ojj G Q f/iaf do no£ depend on s. Further, 
Ci = — Sj>0 a ijfj- 

Proof. By (3.1) it suffices to prove this for i = and i = 2. We claim that for all s > 0, 

(3.7) bf s f s = -/ 2s + 2/ 2s+1 -2(-l) s , 

(3.8) 5f s+2 f s = f 2s+1 + f 2s+3 -3(-l) s . 
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For s = 0, equation (3.7) is true. For s > 1, equation (3.7) can be rewritten as 

5/ s 2 = / 2s+ i+/ 2s -i-2(-ir. 

By (3.5), the RHS is / 2 +1 + 2/ 2 + / 2 _i - 2(-l) s , whence we need to check 
(3-9) 3/ 2 = / s 2 +1 + / 2 _ 1 -2(-ir. 

Replacing / s+ i by f s + / s _i, this becomes 2/ 2 = 2/ s / s _i + 2/ 2 _i - 2(-l) s , which is 
true since by (3.4), / 2 = / s+ i/ s _i - (-l) s = + / 2 _i " (-I)*- 

By (3.4) and (3.5), equation (3.8) becomes 

5/ 2 +1 + 5(-l) s+1 = / 2 +2 + 2/ 2 +1 + / 2 - 3(-l)', 

3/ 2 +1 = / 2 +2 + / 2 - 2(-l)*+ 1 
which is true by (3.9). The formula for a holds because /o = 0. □ 

Proposition 3.2.3. A quadratic identity of the form 

Q(s) ■= a ijfs+ifs+j + ^2b j f 2s +j + (-l) s c = 

i,j>0 j>0 

holds for all s > if it holds for any three distinct values of s. Moreover, if the relation 
is homogeneous and linear (that is, if aij = c = for all i,j), then it suffices to check 
two values of s. 

Proof. Suppose that Q(s) = for s = s ,si,s 2 where < s < «i < s 2 - By 
Lemma 3.2.2 one can convert Q{s) to an equivalent identity of the form 

Q'(k) := ^ aj f k+j + (-1) V = where k = 2s. 

j>o 

We first claim that d = 0. By Lemma 3.2.1 there are constants fi, A such that 

(3.10) f 2s2 +j = M/2si+j + A/ 2so +i for a11 3 > 0- 

Since Q'(k) = for k = 2s ,2 Sl ,2s 2 , and by (3.10), 

If d 7^ 0, we thus have 1 G {±/U±A}. This is impossible: We use the recurrence 
relation (3.1) to extend the sequence (/ n ),n > 0, to negative index n. Note that 
/_„ = (-l) n+1 /n- Then (3.10) holds for all j G TL. With j = -2s and j = -2 Sl we 
get 

_ /2(s 2 -s ) > _ /2(s2-si) 

M-7 > A__ 7 • 

J2(si— so) J2(si—so) 

Therefore, ±/i ± A = 1 exactly if 

±/2(s 2 -s ) = i/2(s 2 -si) + /2(si-s )- 

The signs ++ are impossible because / m+n = / m+n -i + fm+n-2 > fm + fn for all even 
m,n > 0. Further, H — , — h, and are impossible because s 2 > si > sq > 0. 
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We have shown that Q'(k) = Sj>o a jfk+j- Recall that Q'(k) = for k = 2sq, 2si, 2s 2 . 
By Lemma 3.2.1, the expression for Q'(0) can be written as a linear combination of 
the expressions for Q'(2sq) and Q'(2si), and the same is true for Q'(l). Therefore, 
Q'(0) = and Q'(l) = 0. This and the defining relation (3.1) show that Q'(k) = for 
all k. In particular, Q'(k) = for all even k, and so Q(s) = for all s. This proves the 
first statement. The second holds similarly. □ 

In the subsequent sections, the following abbreviations will be useful. 

Definition 3.2.4. The kth Lucas number is defined to be i k = fk-i + fk+i, k > 1. 
We set F k := \f Ak and L k := \t ik+2 . 

Then 

F 1 = 1,F 2 = 7, F 3 = 48, F 4 = 329, 
L = 1, Li = 6, L 2 = 41, L 3 = 281, L 4 = 1926. 

Further for k > define the sequence H k by 

(3-11) H k = | f 2 kf2k+2- 

Then H = 0, H x = 1, H 2 = 8, H 3 = 56, F 4 = 385, H 5 = 2640, .... 

Lemma 3.2.5. The following identities hold for all k > 0: 

(i) F k+1 = L k + F k ; 

(ii) L k+ i = 5 F k+ i + L k ; 

(hi) -fffe+i = H k + F k+ i = Yli=i F H 

(iv) L k = 5H k + l. 

(v) F fc 2 +1 -F fc F fc+2 = l. 

Proof. The second identity in (iii) follows from the first identity in (iii) by induction. 
All other identities have the form considered in Proposition 3.2.3, and so it is enough 
to check each of them for at most three low values of k. □ 

3.3. Reducing E(a n ). We begin with a general remark about the reduction process. 

Remark 3.3.1. Consider a tuple (d; m) that satisfies the Diophantine identities (1.8). 
Proposition 1.2.12 states that (d; m) G £ exactly if it reduces to (0; — 1, 0, . . . , 0) under 
standard Cremona moves, as defined in Definition 1.2.11. In fact, it clearly suffices to 
reduce (d; m) to a known element of £ by any sequence of Cremona moves. Each such 
move consists of an application of the Cremona transformation 

(d; m) I—?- (2d — mi — m 2 — m^; d — m 2 — 7713, d — m\ — 7713, d — m\ — m 2 , . . . ) 

followed by a choice of reordering. It does not matter whether this reordering restores 
the natural order; all that is important is that in the end, after doing many such moves, 
we arrive at a known element of £. In fact, the reorderings chosen below all do restore 
the natural order. The point of this remark is that there is no need to prove this. O 
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Example 3.3.2. The first few elements E{a n ) are 

(9293;W'(a 2 )) = (l0;4 x6 ,l x5 ), 

(9394;W'(a 3 )) = (65;25 x6 ,19 xl ,6 x3 ,l x7 ), 

{94g 5 ;W'(a 4 )) = (442;169 x6 ,142 xl ,27 x5 ,7 x3 ,6 xl ,l x7 ). 

These values of n are too low for our general arguments in §3.3.1 and 3.3.2 to apply. 
Hence one proves that they reduce correctly by direct calculation. 

The following list of Fibonacci numbers will be useful in the subsequent proofs. 



n 





1 


2 


3 


4 


5 


6 


7 


8 


10 


12 


fn 





1 


1 


2 


3 


5 


8 


13 


21 


55 


144 



3.3.1. Reducing E(a n ) for even n. Throughout this subsection we will consider n = 
2m > 2 to be a fixed even number. We will obtain an explicit expression for W'(a n ) and 
then examine its reduction by Cremona moves. By Example 3.3.2 it suffices to consider 
the case n > 6. Hence this case of Theorem 3.1.1 follows from Propositions 3.3.6, 3.3.9 
and 3.3.10. 

For each fixed n, denote k' := n — k and define 

(3.12) u k := fl_ 2k _ 1 + 2H k = fl k ,_ 1 + 2H k k = 0, . . . , m - 1, 

v k := 3F n „ fc - 2F fc = 3F k > - 2F k , k = l,...,m. 

Note that u k ,v k depend on n, though for simplicity the notation does not make this 
explicit. Also, v k > for all k < m and v m = F m . However, the above formula for v k 
gives a negative number when k > m. This is why the expansion in Proposition 3.3.3 
below changes its form at the term v m . Note also that by equations (3.4) and (3.11) 

(3-13) /ln+1 = /2n/2n+2 + 1 = %H n + 1- 

Hence we also have 

(3.14) ^ = 3^ + 2^ + 1. 

Proposition 3.3.3. If n = 2m is even, then the continued fraction expansion of a n is 
[6; 1^5, 3, 1, VLJ =: [6; {1, 5} x (™- 1 ) , 3, 1, {5, 1}^' % 

m—l m—l 

and the (renormalized) weight expansion W(a n ) is (A n ,B n ), where the vectors A n ,B n 
are 

A n = (uo x6 ,vi xl ,ui x5 ,w 2 xl ,...,u m _ 2 x5 ,v m _ 1 xl ,n m _i x5 ), 

r> It? x3 t Xl TP x5 T Xl TP x5 r xl 77 x5 T x]\ 

o n — \r m ,J^ m -l i^m-X , , ■ ■ ■ , ^2 , -^1 ,-fl ,^0 )■ 

Remark 3.3.4. Although our usual convention is that the expression a = [£o; l\, . . . , £n] 
always has £n > 2, we relax this condition here in order to simplify the formulas. We 
allow ourselves another liberty at the end of these expansions: in the formula for B n 
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in Proposition 3.3.3 the last two weights F\, Lq are equal, so the ending multiplicity is 
in fact 6 rather than 5, 1. O 

Proof. Recall that a n = (sn±l \ 2 = h n+1 . Also, because 6 n _i < a n < b n , the weight 

V 9 n ) J2n-l 

expansion w(a n ) begins as (l x6 , (a n — 6) xl , . . . ). Hence 

W(a n ) = fl n -iw(a n ) = (/| n _i X6 ,/| n+ i - 6 fin-!, ■■■)■ 

Therefore, because v m = F m as noted above, the expansion of W(a n ) up to and includ- 
ing the term L m —\ follows from Lemma 3.3.5 below. The rest of the expansion holds 
by Lemma 3.2.5. □ 

Lemma 3.3.5. Let n = 2m > 4. The following identities hold. 

(i) /In+i =6u +v 1 and vi < u ; 

(ii) u k = v k+1 + u k+1 and u k+1 < v k+1 for k = 0, . . . , m - 2; 
(hi) v k = 5 u k + v k+l and v k+l < u k for k = I, . . . , m - 1; 
(iv) ii m _i = 3F m + L m _i and L m _i < F m . 

Proof. Statement (i) is equivalent to 

(3.15) fi n+1 = 6 + 3F n _! - 2. 

Since fg +1 = 6fg_ 1 + f2s-4 — 2(— l) s holds true for s = 2,3, 4, this equation holds true 
for all s > 2 by Proposition 3.2.3; in particular it holds true for all even s > 2, and 
so (3.15) holds true. 

The equality in (ii) follows from the definitions of u k ,v k by using (3.14) and H k = 
Ym=i Fi an d by dividing the equations into two equations, one for k and one for k' := 
n — k. To prove the equation in (hi) we again divide it into two equations, one for k 
and one for k' := n — k, namely 

-2F k = 10H k + 2 - 2F k+1 , 3F k , = 5/ 2 2 fc ,_! - 2 + 3F fc /_i. 

The first equation is equivalent to L k = 5H k + 1, which holds by Lemma (3.2.5) (iv), 
and the second holds because $2 S = 5/^_ 1 — 2(— l) s + /2 S -4 is true for s = 2,3, 4 and 
hence for all s > 2 by Proposition 3.2.3. 

Equation (3.14) and Lemma 3.2.5 imply that 

u m -i = 3H m + 2iT m _i + 1 = 3F m + 5iJ m _i + 1 = 3F m + L m -\. 

This proves (iv). 

Since u k ,v k are positive in the given ranges, the equalities in (ii) and (iv) imply the 
inequalities in (i) and (hi) . Similarly, the equalities in (hi) imply the inequalities in (ii) . 
This completes the proof. □ 

The reduction process has three steps that are described in Propositions 3.3.6, 3.3.9 
and 3.3.10. Notice, before we begin, that the weights of a n divide into three groups, 
namely (m — 1) pairs {1,5}, two central blocks with multiplicities 3,1, and finally 
(m — 1) pairs {5, 1}. In the first step we show that a set of 5 Cremona moves has 
the effect of moving the first {1,5} pair from the left to a {5,1} pair on the right. 
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Moreover, doing this introduces no new weights to the right while slightly modifying 
the first block on the left. 

Denote V n = E(a n ) = (g n g n+1 ;W(a n ), 1). Note that 

9n9n+l = /2n-l/2n+l = /In + 1 = /In + fl 

by (3.4). 

Proposition 3.3.6. For n = 2m > 6, the vector V n is reduced by 5(m — 1) Cremona 
moves to the vector = (/| (m+1) + /Im-i! A n> B n)> where 

A\: = ((u m _i +F m _i) xl , n m _i x5 ), 

73I . / TP X3 t Xl jp XlO r X2 TP XlO t X2 77 XlO r x2 ,xl\ 

D n . — \r m , L m -\ , P m -l , L m ^2 , ■ ■ ■ , ?2 ,^1 j r\ i^O 5 1 )■ 

Let 

V(n,l) := (fl n + l;A(n,l),B(n,l)) := (/| n + 1; A n , B n ) = V n , 

and for k = 2, . . . , m define the vector V(n, k) by (/|( n _ fc+1 ) 
where 

A(n,fc) = (K-i +F fc _ 1 ) xl ,u fc _i x5 ,v fc xl ,n fc x5 ,...,,v m _ 1 xl ,u m _i x5 ), 
B(n,k) = ( F m x3 , L m _i xl , . . . , Ffc x5 , Lfe_i xl , Ffe_i xl °, Lfc_ 2 x2 , • • • , 

Then y(n, 1) = ^ and ^(n, m) = Moreover, ^4(n, A; + 1) is obtained from A(n, k) 
by replacing its first seven entries by the single entry + = /| ( - n _ fc )_ 1 + 2 + F^; 

and B(n,k+ 1) is obtained from B(n,k) by inserting ( Ffc x5 , Lfc_i ) . Proposition 3.3.6 
will follow if we prove: 

Lemma 3.3.7. For 1 < k < m — 1, V(n, k) reduces to V(n, k + 1) by 5 Cremona 
moves. 

We prove Lemma 3.3.7 in five steps. Throughout, we abbreviate n — k to k! . Thus 
k! > k. 

Remark 3.3.8. (i) Each step of the reduction involves many small calculations that 
can be done directly using identities such as (3.4), (3.5) and (3.6). However, in all 
cases the required identity is quadratic in the sense of Proposition 3.2.3. Hence they 
can be all proved by verifying them for just three low values of s, as we have already 
illustrated in the proof of Lemma 3.3.5. The only condition on the choice of s is that 
all subscripts of the ft should be > 0. 

(ii) In each step of the reduction process there is no interaction between the terms 
in k and those in k'; we simplify each set of terms separately. O 

Step 1: There is a Cremona move that takes V(n, k) to 

Vi(n,k) := (/4fc'+2 - Fk-i; u fe _i x3 , v k , /W+ijW - Hk-i, 

(/2fc'+l/2fc' - -Hfc-1 - Fk-l)* 2 , Uk X5 , ■ ■ ■)■ 
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Proof. The first component of the Cremona transform of V (n, k) is 2 fik-i) ~ 

3«n, and we must show that it equals fw+i- Since u k -\ := + 2H k _i, we need 

to see that 

2 /|(fe'+l) ~ 3 /ffc'+l ~ iW+2 = 6i?fc_i - 2/ 2fc _ 1 = 2 (/2fc-2/2fc - /|fe-l) • 

But both sides are equal to —2. This is clear for the RHS by (3.4). For the LHS, note 
that 2/^ +2 — 3/| +1 — /2S+2 = — 2(— l) s , since this is true for s = —1,0, 1 and hence for 
all s > —1 by Proposition 3.2.3. 

The second three terms of the Cremona transform of V(n, k) equal + fik-i) " 

2uk-i, and one can check as above that this is /W+i^fc' — Hk-i- Therefore the given 
vector Fi(n, fc) is a reordering of the Cremona transform of V(n, k). □ 

Step 2: There is a Cremona move that takes V"i(n, k) to 

V 2 (n, k) := (jw + hk'+ihw — H k — v k , hk'+xhk 1 ~ -Hfc-i, 

{hk'+lf2k' - Hk-1 - Fk-l)* 2 , (/2fc'/2fc'-l - /2fc/2fc-l) X3 , ^fc X5 ; • • 

Proof. For the first component we need to see that 

2 (/4fc'+2 - ^fc-l) — 3life„i = / 4 fc/ + /2fe'+l/2fe' — H k — F k _i. 

But this is equivalent to the identity 

(3.16) 2/ 4fc / +2 - 3/| fc / +1 — / 4 fc/ — /W+iiW = 6 i^fc—i + .Ffc-i - 

and one can check that both sides here equal —1. 

The Cremona transform also contains three terms of the form / 4 fc'+2 — ^fc-i — 2 
and we need to check that this is iW/W-i — /2fe/2fc-i- But this holds because 

/4fe'+2 - 2/|fc'+i — hk'hk'-l = \f2k-2f2k — f2khk-l + \fik-i = ~ 1- 

Thus V^iV, A;) is a reordering of the Cremona transform of Vi (n, fc). □ 
Step 3: There is a Cremona move that takes ¥2(71, k) to 

Vjs(n, A;) := ( / 4 fc' — i*fc-i; iW+i/W — H k _i~ F k _i, (iW/W-i ~ f2kf2k-i) x4 , Wfc X5 5 • • • ) 
where the multiplicities of F k and L k -i are each increased by one to F k x6 , L k -i x2 ■ 
Proof. Since = i^jt-i + the first term 

2 (/ 4 fc' + f2k'+ihk> — H k — F k _i) -v k — 2 {f2k'+ihk> ~ #fc-i) + ^fc-i 
of the Cremona transform of V^n, fc) is equal to 

2 /W - 2-Ffe — - (/ 4fe / - 2 Ffc) = / 4fe / - F k _ x . 

Its second term 

(/4fc' + hk'+if2k' — H k — F k _i) - 2 {hk'+ifik' - Hk-i) + F k _i 
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simplifies to 

iW - jW+i/W - F k + -fffe-i = fik'fik'-\ - /2fc/2fe-i, 

where the last equality follows from the identities 

(3.17) f\k' — hw+xhk 1 — f2k'hk>-\ = 0, F k — Hk-i - fokhk-\ = 0. 
The third term of the Cremona transform of V^n, k) is 

(fik' + f2k'+\hk> — Hk — Fk~\) — V k — {hk'+\hk' ~ ~ -Ffc-l) ■ 

This is equal to F k . In the same way, we find that its fourth term is Fk — F k ^\ = L k -\. 
The result follows immediately. □ 

Step 4: There is a Cremona move that takes V%{n,k) to 

V4(n, k) := ( /W+iiW - H k _i - F k -\ + 2F k ; /| fc , + + F k , 

(f2k'f2k'-i - f2kf2k-i) x2 , Uk x5 , ■ ■ • ) 
where the multiplicities of F k and L k ^\ are now F k xS , L k ^\ x2 . 

Proof. By (3.17), the first term of the Cremona transform of V%(n, k) is 

f2k'+\f2k' - F^k-i - -Ffc-i + 2-Ffe. 
We claim that its second term is 

fik' - F k ^i - 2 (/2fe'/2fc'-l - f2kf2k-l) = fik' + iffc-i + Fk- 
This follows from 

(3.18) f 4k > - 2 f2k'hk'-i - fik' = 0) F k _i - 2 fikf2k-\ + fik-i + F k = 0. 
Finally, the third and fourth term of the Cremona transform of V^(n, k) are 

f\k' - -Ffc-l — (/2fc'+l/2fe' — F[ k _i - Ffc_i) — (/2fc'/2fc'-l - f'2kf2k-l) = F k 

where the equality holds by (3.17). Hence V^{n : k) is a reordering of the Cremona 
transform, as claimed. □ 

Step 5: There is a Cremona move that takes V^n, k) to 

(3.19) V(n, k + 1) := (/| fe , + /| fe+1 ; u k + F k , F k x2 , u k x5 , . . . ) 
where the multiplicities of F k and L k -\ are F k xl °, L k -i x2 . 

Proof. The above expression for the first term of the Cremona transform of T4(n, k) 
follows from the identities 

2 f2k'+lf2k' - 2f2k' - 2 /2fe'/2fe'-l = 0, 

2 -Hfc-i + 2 - 3 F k + - 2 f2kf2k-\ + /ffc+i = °- 

(The second identity can be simplified by subtracting the second identity of (3.18).) 
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We next claim that the second term of this transform is Uk + Fk- Since Uk = 
fik'-i + ^ Hk and H^-i + Fk = Hk, this is equivalent to the identity 

hk'+lhk' - ^hk'hk'-l - hk'-\ = 3i7jfc_i + F k -i + F k - 2/ 2 fc/2jfc-i. 

But both sides equal —1. 

Finally, its third and fourth terms are Fk because 

Here both sides vanish. □ 

This completes the proof of Lemma 3.3.7 and hence of Proposition 3.3.6. The next 
stage of the reduction process results in a vector whose components are linear 
(rather than quadratic) functions of the fk and do not involve the index k' . 

Proposition 3.3.9. When n = 2m > 6, the vector may be reduced by four Cremona 
moves to 

\r2 I 77 Xl. t Xl TP Xll r X2 771 X 10 7- X2 77 Xl3\ 

V n — \*m i L m-l ,^m-l , L "m-2 ,---,^2 , -^1 )■ 

Proof. Note that is obtained from B\ by removing two copies of F m and adding 
one F m _i. 

We first claim that the Cremona transform of is 

f 4m- 1 ~\~ F m —i, fim—1 1 U m —i 

which we reorder as 

(3.20) (/ 4m+2 Fm— 1 j "m- 1 X3 > f4m-l + F m -i, /4m-l X2 , B^) ■ 
Here one obtains the first term of the transform from the identity 

2 (/f(m+l) + /Im-l) ~~ 3ll m _i = /4 m+ 2, 

and the second term from 

fim+2 + fim-l ~ 2«m-l = /4m- 1 + F m -\. 

Next, observe that Lemma 3.3.5 (iv) implies that 2u m _i = 6F m + 2L m _i, which, as 
one can easily check, is just f^m+2 — F m -\. Therefore, the second Cremona transform 
moves the vector (3.20) to 

(3.21) (n m _i; f^m-x + F m -i, /4m-i x2 , B^)- 

We next claim that u m -\ — 2/4 m _i = F m -\. Hence the third Cremona transform 
moves the vector (3.21) to 

(/4m-i + F m -i; F m -i, x2 , B„), 

which we reorder as 

/np . TP X3 T Xl TTi Xll 7- X2 77 X 10 r X2 771 X 10 r X 2 771 X 13 \ 

{4r m ,r m , i^ra-\ , J^m-2 , ^m-2 , ^m-2 , ■ ■ ■ , ^2 , -^1 , ^1 )■ 

Therefore, another Cremona move takes the above vector to 

T/2 ( TP ■ T Xl TTi Xll j X2 TP Xl0 r X2 77 X 10 t X 2 77 Xl3\ 

V n — \ p m, , ^m-l , ^m-2 , ^rn~2 , ^rn~2 ^2 , -^1 , ^1 )■ 
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This completes the proof of Proposition 3.3.9. 



□ 



Proposition 3.3.10. For n = 2m > 6, the vector may be reduced to (2;l x5 ) by 
Cremona moves. 

Proof. One shows by direct calculation that this holds when n = 6. Therefore, by 
induction it suffices to show that the vector V% is reduced to V%_ 2 by 6 Cremona 
moves. By using the identities in Lemma 3.2.5, it is not hard to prove directly that 
this reduction may be achieved by six standard Cremona moves. Alternatively, one 
can check numerically that this holds when n = 8 and 10, checking also that the 
reordering required is the same in both cases at each stage. Then it holds for all n by 
Proposition 3.2.3. (Note that we only need to check two values since all identities are 
homogeneous and linear.) □ 

3.3.2. Reducing E{a n ) for odd n. Throughout this section we denote n = 2m+l, where 
m > 1. By Example 3.3.2 it suffices to consider the case n > 5. Hence this case of 
Theorem 3.1.1 follows from Propositions 3.3.13, 3.3.14 and 3.3.15. 
We consider the numbers 



as before. Again we have v m > > v m+ \ but now u m = L m . Therefore Lemma 3.3.5 
takes the following form. 

Lemma 3.3.11. If n = 2m + 1, the following identities hold. 

(i) fin+i = 6n + -ui and vi < u ; 

(ii) u k = v k+1 + u k+1 and u k+1 < v k+1 for k = 0, . . . , m - I; 
(hi) v k = 5 u k + v k+1 and v k+1 < u k for k = 1, . . . , m - I; 
(iv) v m = 3 u m + F m = 3 L m + F m and F m <C L m . 

Proof. The proofs of the equalities in (i), (ii) and (iii) go through as before, since these 
are based on equalities that do not mention m. (Note that the proof of the equality 
in (ii) works when k = m — 1, though the corresponding inequality failed for even n.) 
One then checks (iv). Then the inequality in (ii) follows from the equalities in (iii) and 
(iv), while the inequality in (iii) holds by (ii). □ 

As before, this lemma immediately gives the following result. 

Proposition 3.3.12. If n = 2m + 1 is odd, the continued fraction expansion of a n = 




2n-2fe-l 



+ 2H k , k = 0,...,m, v k = / 4 ( n _ fc ) - 2F k , k = 1, . . . 



m 




A, 



■n 




B, 



n 




xl 



r x5 t xl\ 
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The proof of Proposition 3.3.6 also goes through. In other words, each set of five 
Cremona moves takes one of the m — 1 pairs {1,5} from the left to the right of the 
central blocks (which now have multiplicities 1,3), while introducing no new weights 
on the right. Since we start with m blocks on the right but only (m — 1) on the left, 
this means that one pair {5, 1} still remains on the right, though all the others become 
{10, 2}. The only other difference is in the interpretation of the first term of V(n, m): 
when n = 2m + 1 and k = m we have 2(n — k + 1) = 2(m + 2). Thus we obtain: 

Proposition 3.3.13. For n = 2m + 1 > 5, the vector V n = {g n g n+ \;W (a n ),l) is 
reduced by 5(m — 1) Cremona moves to the vector V\ = ^/f m +4 + fim-i'i ^n>-^n)> 
where 

A\ := ((n m _i + F m _i) xl , u m _! x5 , w m xl ), 

Si / T x3 77 x5 T xl Z7 XlO 77 X 10 J X 2 77 X 10 J X2 iXl\ 

-°n ■— \ L m , -fm j j , ■ ■ ■ , ^2 , ^1 , ^1 ,^0 j 1 )■ 

Notice that the entries in still depend explicitly both on k and on k! = n — k 
since u m -\ and f m -i have this structure. This means that there are entries in that 
do not occur anywhere in the reduction of V n+ 2- The next stage takes us to a vector 
that occurs in the reduction of all V n+ 2i- For even n, this stage consisted of four moves, 
but now it takes six moves. 

Proposition 3.3.14. When n = 2m + 1 > 5, the vector is reduced by six Cremona 
moves to 

T>2 It __ 77 . IT _OZ7 N lXlpX7r X2 p X 10 r X2 \ 

' n • — v m r mi\- L 'm ^ r m) i r vn ;^m— 1 j r m— 1 j ^m— 2 i ■ ■ ■ ) ■> 

where the terms including and after F m _i xl ° in V% are the same as those in B\. 

Proof. Since the proof is much the same as that of Proposition 3.3.9, we simply list the 
results of each Cremona move. Here are the results of the first four moves: 

( fim+6 — Fm— l't^m— 1 iV m , 

fAm+3 + fim-1 + F~m-li (/4m+3 + fim-l)* 2 , L m X3 , . . . ) 
( fim+b + L m -i;v m , /4m+3 + /4m- 1 + Fm-1, (/4m+3 + /4m-l) X2 , L m x3 , 

/4m+l X3 , • • • ) 

( /4m+4 — Fm-1] fim+3 + /4m-l 5 L m X ^ , f&m+l* 1 F m , L m „i . . . ) 

( /4m+3 + fim-l'i fim+3 + fim-l ~ L m , L m , fim+l* 4 , F m , L m __i, . . . ) . 

None of these moves uses up any of the terms of B^ after L m x3 , though the multiplicity 
of F m , L m _ i is increased. Hence after the entry for L m we have simply listed the extra 
terms that get added to B\. Note also there is just one new number that appears after 
L m , namely /4 m +i = L m — F m which appears with multiplicity 4. Using the same 
conventions, the next move gives: 

( 2/4m+i; /4m+l X3 > fim+l ~ F m , F m X2 , L m _i, . . . ) . 

The last move changes the first four terms to the single term f4 m +i = L m — F m . □ 
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Proposition 3.3.15. For n = 2m + 1 > 5, the vector V 2 may be reduced to (2; l x5 ) 
by Cremona moves. 

Proof. This is just the same as the proof of Proposition 3.3.10. □ 

4. The interval [t 4 ,7] 

This section is devoted to the calculation of c(a) on the interval [r 4 , 7], thus complet- 
ing the proof of part (ii) of Theorem 1.1.2. Proposition 1.2.9 gives an easy argument 
that c(a) = ^jjp when a G [6j^,7|]. To prove that this holds on the whole interval 
[r 4 , 7], we shall adapt the arithmetic approach that works for a < r 4 rather than using 
more analytical arguments as in the case a > 7. We show in Proposition 4.1.6 that 
it suffices to restrict attention to some special points with relatively short continued 
fraction expansions that are related to the convergents of r 4 , and then deal with these 
special points by largely arithmetic means. The proof that c(a) = ^jp on [r 4 ,7] is 
given at the end of §4.1. 



4.1. Reduction to special points. As in Section 2.1, given a with weight expansion 
w(a) and (d;m) G £, we define e by m = -^w(a) + e, and denote E := Yl £ 1- Also 
set A 2 := 1 - E. Denote 

y{a) := a + 1 — 3\fa. 

Since y(r 4 ) = we have y(a) > for a > r 4 . The first result extends Proposition 2.1.1. 



Proposition 4.1.1. Let a G (r 4 , 7) and suppose that (d; m) G £ is such that fj,(d; m)(a) > 
3^/a 



Then 



(i) d< 



Va 2 - 7a + 1 ' 



.... , 9 2d 2 y(a) 

Proof. By Proposition 2.1.1 (i) we have < y/ayfl + 1/d 2 , proving (i). 
Since 9 ^ < m ■ w/d = i/a + e ■ w/d, we have y(a)d/3 < e ■ w. Further, 

d 2 + I = m ■ m = —a + ^w(a) ■ e + e ■ e, 

i.e., e ■ w(a) = proving (ii). □ 

The continued fraction expansion of r 4 = I±|^ is [6; 1, 5, 1, 5, ... ]. For k > 1 define 
its /sth convergent by 

cafc.! := [eiO.^M,!] - [6;{l,5}><( fe - 2 ),l,6], 
c 2fc := [6;{l,5} xfc ]. 
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Thus 

(4.1) Cl = 7, c 2 = 6| = f, c 3 = [6;l,5,l] = [6;l,6]=6f = f , 
c 4 = [6; 1, 5, 1, 5] =6f = ^, c 5 = [6; 1, 5, 1, 6] = 6§ = 

and more generally 

C2k < c 2 k+2 < r 4 < c 2k+1 < c 2 fe_i for all fc > 1. 
Moreover, for > 1 and j > 1 define the numbers u k (j),v k (j) £ (c2k+i, c 2k-i) by 

«fcO') == ^{l^}^- 1 ),!^], 
v k (j) := [^{l^-^^j]. 

As in Definition 3.2.4, we define for k > 1 the fcth Lucas number = + fk+i, and 
set -Ffc = \fik and Lfc = \£&k+2- Recall from Lemma 3.2.5 that for all k > 0, 

(4.2) F fc+ i = L fc + F fc and 4+1 = 5^+1+^ 
as well as 

(4-3) F fe 2 +1 -F fc F fe+2 = 1. 

Lemma 4.1.2. For all k > 1, 

/ ' \ Fk+i Lk+\ 

V) C2k-i = t-> ; c 2 fc — — — ; 

(11) u k (j) = for all j > 1; 

/...x , -x Lk+jFk+i {j - 6)F k+1 + F k+ 2 , u . . 

(111 Vk(j) = — — = ~r~. g\ 771 1 771 f° r al1 3 > 1 - 

L k -i + jF k (J - 6)F k + F k+1 

Proof. Recall that if p n /q n is the nth convergent to [£q; £±, . . . , £jy] then, for any n < N 
and any positive xelwe have 

I a a\ \i> n n 1 Pn-2 + %Pn— 1 

(4.4) 4;4,...,4_i,x = ■ . 

q n - 2 + xg„_i 

(i) follows from induction on k: The statement is true for k = 1. Assume it holds 
for A;. By (4.4) with ar = 1, and by (4.2), 

Fk+i + _ F k+ 2 

Fk + Lk Fk+i 
Then, by (4.4) with x = 5, and by (4.2), 

_ £fc+i + 5i ? fc+2 _ L k+ 2 
Lk + 5F k+ i L k+1 

(ii) follows from (i) by using C2 k +i = [6; {1, 5} x( - k ~ 1 \ 1, 6]. 

(iii) follows from (i) and (4.2). □ 



48 DUSA MCDUFF AND FELIX SCHLENK 

Corollary 4.1.3. For all k > 1, s > 3 and t > 8 we have 

c 2 fe+i < u k (s + 1) < u k (s) <■■■ < u k (2) < u fc (l) = v k {7) < v k (t) < v k (t + 1) < c 2 fc_i. 
Proof. This follows from Lemma 4.1.2 and identity (4.3). □ 

The following corollary will be very useful. 
Corollary 4.1.4. (i) Let u = u k (j) =: | ; where j > 1. TTien 

g 2 (u 2 -7u + l) = j 2 + 7j + l. 
(nj Let v = v k (j) =: |, where j > 1. T/ien 

g 2 (V - 7v + 1) = j 2 - 5j - 5. 

Proof. The proofs of (i) and (ii) are similar. We prove (ii). In view of Lemma 4.1.2 (iii) 
we need to show 

(4.5) (L k + jF k+1 ) 2 - 7 (L fc _x + jF k ) (L k + jF k+1 ) + (L fe _x + jF k ) 2 = j 2 - 5j - 5. 

Fix j. Identity (4.5) is true for k = 1,2,3. It therefore holds for all k by Proposi- 
tion 3.2.3. □ 

Definition 4.1.5. We say that a point a £ [t 4 ,7] is regular if for all (d;m) £ £ with 
£(m) = l{a) we have fi(d;m)(a) < 

Proposition 4.1.6. Assume that all the points c-i k -\ and all the points 

u k (j) with k > 1 and j > 2 and v k (j) with k > 1 and j > 7 
are regular. Then c(a) = pjp on [r 4 ,7]. 

A main ingredient in the proof will be the following 
Lemma 4.1.7. Consider the functions ip(a) := pp and ip{a) := yfa. Fix k > 1. T/ien 

(i) ¥>(wfc(j + 1)) > ip(u k (j)) for all j > 1; 

(ii) ip(v k (j)) > tp(v k (j + 1)) /or a// j > 7. 

Proof, (i) Abbreviate ix := tifc(j + 1), u' = u k (j). We need to show that pp- > Vuf, 
i.e., 

(4.6) m 2 + 2u + 1 > 9u'. 
Recall from Lemma 4.1.2 (iii) that 

(47) U + l ) F k+2 + Fk+i , = jF k+2 + F k+1 

In particular, the denominator of u is q := {j +l)F k+ \ + F k . Applying Corollary 4.1.4 (i) 
to u we therefore find 

„ 2 = ( J+ i) 2 + y + i ) + i +7M _ 1 
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and so (4.6) is equivalent to 

(4.8) 9{u' - u)q 2 < (j + l) 2 + 7(j + 1) + 1. 
Using (4.7) and F% +1 — F k F k+ 2 = 1 we compute 

u >_ u= 1 

((j + l)F k+1 + F k ) (jF k+1 + F k ) ' 

Inequality (4.8) therefore becomes 

(4.9) 9 ^ + l )Fk+ ^ Fk < (J + I) 2 + 7(J + !) + !. 

J*k+1 + r k 

For all j > 1 the second factor on the LHS is < 2 and the RHS is > 19, and so (4.9) 
holds for all j > 1. 

The proof of (ii) is similar (but slightly easier). □ 



Proof of Proposition J^.1.6: Assume that c(a) < ^jjp does not hold on [t 4 ,7]. Since 
c(a) > > y/a on (r 4 , 7], Corollary 2.1.4 shows that c(a) is piecewise linear on (r 4 , 7]. 
Let 5c(r 4 , 7) be the set of non-smooth points of c on (r 4 , 7). This set decomposes as 
S = S+ U S-, where S+ (resp. S-) consists of those s 6 5 near which c is convex (resp. 
concave). Note that for s G S+ we have c(s) > By Proposition 1.2.9, c(a) = 

for a G [6j^, 7], and so the biggest point of S is in S-. This and c(r 4 ) = r ^ imply that 
the set S + is non-empty. Let ao = maxS' + . Then ao G (r 4 ,7). By Corollary 2.1.4 (i) 
there exists (d; m) G £ and e > such that 

(4.10) c(z) = n(d;m)(z) on [ao,ao + e]. 

Abbreviate /x(^) := n{d;m){z). By (4.10), /x(ao) = c(a ) > > y/ao. Let / 

be the maximal open interval containing ao such that fi(z) > \fz for all z £ I. By 
Lemma 2.1.3, there exists a unique a' £ I with £(m) = £(a'), and £(m) < £(z) for all 
other z £ I. Further, by Proposition 2.3.2, the constraint fj,(z) is given by two linear 
functions on /: 




a + f3z if z < a', z G /, 
a' + P'z if z > a', z G /, 



that is, a' is the only non-smooth point of [i on /. By (4.10), and since ao G S+ and 
fi < c, the point ao G / is also a non-smooth point of fi, and hence a' = oq. Now (4.10) 
and the fact that c is nondecreasing show that (3' > 0. 

Let k > 1 be such that ao G [o2 k +i,C2 k -i]- Since C2fc+i and C2 k -i are regular by 
assumption, we have ao G (c2 k +ii c 2k-i)- Note that u k {j) — > C2 k +i and v k (j) — > oi k -\ 
as j —7- oo. Let u-,u+ be the two neighboring points from the sequence 

• • • < u k (s + 1) < u k (s) < ■ ■ ■ < u k {2) < u k (l) = v k (7) < v k (t) < v k (t + 1) < . . . 

from Corollary 4.1.3 (ii) with ao £ [ u -> u +]- Since n_ and u + are regular by assump- 
tion, we have ao G (u-,u + ). Then /i(ao) > > "~ 3 +1 = p{u-) > ip{u + ) = y/u+ by 
Lemma 4.1.7. Since /3' > 0, it follows that fi(u+) > //(ao) > y/u+, and hence u + G /, 



50 DUSA MCDUFF AND FELIX SCHLENK 

and hence £(u + ) > £(ao). However, £(z) > £(u-) and £{z) > £(u+) for all z G (u-,u + ); 
in particular, £{clq) > £(u+), a contradiction. □ 

As we see in the next two lemmas, one can prove that most of the points Uk(j) and 
Vk(j) are regular by direct arguments. 

Lemma 4.1.8. The points Uk(j) with k > 1 and j > 2 are regular. 

Proof. Abbreviate u := Uk(j) and | := u in lowest terms. Assume that (d;m) G 

£ is such that fi(d;m)(u) > and £{u) = £(m). By Proposition 4.1.1 (i) and 
Corollary 4.1.4 (i) we can estimate 

d 3 3 

< 



Qy/u qyju 2 -7u + l \/i 2 + 7j + 1 ' 

an estimate independent of fc. Note that , 3 < 1 for j > 2. Since ^(u) = £(m), 

Vj' 2 +7j+l 

we have m; > 1 for all i. Therefore, 

d^ 2 



£ > j 1 = > j 1 =: s(j). 

The function s(j) is increasing in j, and s(3) > 1, proving the lemma for j > 3 and all 
fe > 1. 

Assume now that j = 2. In this case, s(j) < 1. We therefore need to use the better 
estimate E = 1 — X 2 < I — 2 g^' > from Proposition 4.1.1 (ii). With this estimate we 
have 

n , 2 ( d V 2d 2 y(u) 
= E + X 2 -l > 2 1 + - "l_ ' - 1 



g^/u/ 3 
2 2y(«)\ / 4\ 



., + W + — r r +1 =: 

We need to show that f(d) > 0. Since / is a quadratic polynomial in d, this holds if 
its discriminant is negative, 

ii < j( 2 | 



Multiplying by | and using = u + 1 — 2>y/u, this is equivalent to 

1 u + l / _ 

<7 Z 3 

Taking squares, replacing u by |, and multiplying by 9g 4 , this becomes 
(4.11) 0< -9 + p 3 g-7p 2 g 2 +pg(-18 + ^ 2 ) . 

Recall now from Lemma 4.1.2 (ii) that p = 2Fk +2 + ^fe+i and q = 2Ffc +1 + F^. Since 
pq > 9 for all A; > 1, (4.11) follows from the identities 

1 = p2_ 7pg+ (_ lg + ? 2) ) 
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Lemma 4.1.9. The points Vk(j) with k > 1 and j > 8 are regular. 

Proof. Abbreviate v := Vk(j) and | := v in lowest terms. Assume that (d;m) G £ 

is such that fj,(d;m)(v) > 2 ^ and £(m) = Again, by Proposition 4.1.1 (i) and 

Corollary 4.1.4 (ii), 

d 



< 



qVv q^v 2 -7v + l ^/j 2 - 5j - 5 ' 
)te th; 

mi > 1 for all z. Therefore, 



independent of k. Note that , 3 < 1 for j > 8. Since £(v) = £(m), we have 



d - 2 



E > 3 1 7= > 3 1 ; == <C?')- 

The function is increasing on {j > 8}, and t(8) > 1, whence the lemma follows. □ 

For k > 1 and i > we set 
(4.12) 6 fc (i) := « fc (l + 3i) = [6;{l,5} x < fc - 1 > J l,l + 3»]. 

Hence 6 fc (2) = u fc (7), 6 fc (3) = « fc (10), .... 

Lemma 4.1.10. c(a) = /or aZZ a = 6^(i), fc > 1, i > 2. In particular, the points 
Vk(7) are regular for all k > 1. 

The proof is postponed to Corollary 4.2.4 in the next subsection. It uses the existence 
of special (nearly perfect) elements of £, rather than the estimates of Proposition 4.1.1. 

Proof of Theorem 1.1.2 part (ii). By Proposition 4.1.6 it suffices to show that for 
all k > 1 the points C2k-i, Uk(j), j > 2, and Vk(j), j > 7, are regular. Regularity holds 
for Uf~{j), j > 2, by Lemma 4.1.8 and for Vk(j), j > 8, by Lemma 4.1.9. Moreover, 
when a belongs to the subsequence 6fc(i), i > 2, of the ffc(j'), then Lemma 4.1.10 makes 
the stronger statement that c(a) = ^jp. This holds in particular when a = v^{7) = 
6^(2). Hence these points are regular. Further, because the sequence ->2 con_ 

verges to C2k-i as i — > oo, the continuity of c implies that c(a) = also at a = C2k-i- 
Hence these points are also regular, which completes the proof. □ 



4.2. The classes E(pk(i)). Recall from Section 3.1 that for n > the points b n = 
< r 4 are ^ ne break points of the Fibonacci stairs. The next result shows their 
relation to the numbers bk(i), k>l,i>0, defined by (4.12). 

Lemma 4.2.1. &2fc = 6^(0) and &2fc+i = Ofc(l) for all k > 1. 
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Proof. Using Proposition 3.2.3 and Definition 3.2.4 we see that L k + F k+ \ = g2k+2 and 
L k + 4Ffe + i = <72fc+3 for all k > 0. Together with Lemma 4.1.2 (iii) we conclude that 

, / nA / -i \ L k + F k+ i §2k+2 , 
h{0) = v k (l) = — - = = b 2k , 

L>k-\ + ^k 92k 

MD = v k (4) = Lk+4Fk +J = f** = W 
^k-l + 92k+l 

as required. □ 

The lemma says that the sequence (6 n ) , n > 2, extends to a double sequence (bk(i)) , 
k > 1, i > 0, where for each A: > 1 the sequence (&fc(i)), « > 0, emanates from the pair 
(&2fc,Wi) = (6fc(0),6 fc (l)). 

Recall from Section 3.1 that £?(6 n ) := (g n +i', W(b n )) , where W(b n ) := g n w(b n ). 
In order to prove Lemma 4.1.10, we associate classes E(b k (i)) to all b k {i) as follows. 
Let b k (i) =: -. Let m k (i) be the tuple obtained from qw(b k (i)) by replacing its 
last block (l x ( 1+3i )) by (i, l x ( 1+2i )) , and set d k (i) := q(l + b k {i))/3. Then define 
E (h(i)) : = (d k (i);m k (i)). 

Note that for i = 0, 1 we have (with n = 2k + i) that m k {i) = g n w{b k (i)) = W(b n ) 
and, by (3.2), 4(0 = 5n(l + )/3 = (<7n + g n +2) = g n +i- Therefore, 

(4.13) E(b k (0)) = E(b 2k ), E(b k (l)) = E(b 2 k+i) for all k > 1. 

Proposition 4.2.2. E{b k (i)) G £ /or all k > 1 and i > 0. 

Before proving Proposition 4.2.2, we show that it is the key to completing the cal- 
culation of c on the interval [1, 7]. 

Corollary 4.2.3. Part (i) of Theorem 3.1.1 holds. 

Proof. This is immediate from equation (4.13). □ 
Corollary 4.2.4. Lemma 4- 1-10 holds. 

Proof. We argue as in the proof of Corollary 3.1.2. Let (d;m) G 6. Write b' := b k (i), 
d' := d k (i), ml := m k (i), so that (d';m') = E(b k {i)). If (d;m) = (<f;m') ; then 

/ j n /,a m'-w(b') 3b' r- 

because b' > r 4 . If (d; m) / (<f; ml), then dd' > m ■ m' by positivity of intersections. 
By the definition of d' and ml , and since m is ordered, this spells out to 

qd^-^- > m m! > qm • w(b'), 
i.e,^>k-«)(6')=:^;m)(fe'). □ 
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Remark 4.2.5. (i) Note that the classes E(b k {i)) = (dfc(i); mfc(«)) are perfect for 
i = 0, 1, but are not perfect for % > 2. Since, nevertheless, at bk(i) =: | we have 

(4.14) m k (i) ■ w(b k (i)) = qw{b k (i)) ■ w(b k (i)) = qb k (i) for all i, 
these classes are useful also for i > 2. 

(ii) For i > 3 there are other choices for m k {i) that can be used to prove Lemma 4.1.10. 
All one needs is that (4.14) holds. For this, one just needs to alter the last block 1 XJ 
so that the sum of entries stays intact, while the sum of the squares goes up by i 2 — i. 

If i = 2, one has no choice: In order to make the sum of squares go up by 2 one 
must replace l x7 by 2, l x5 . If i = 3, however, instead of replacing l xl ° by 3, l x7 , one 
can replace it by 2 3 , l x4 . The resulting class (c/fc(3); m' fc (3)) lies in £ for all k. If i = 4, 
instead of replacing l xl3 by 4, l x9 , one can make the sum of squares go up by 12 also 
by replacing mfc(4) = (. . . , l xl3 ) by one of 

m«(4) := (...,3 X2 ,1 X7 ), mf(4) := (. . . , 3, 2 x3 , l x4 ) , m< 8 >(4) := (...,2 x6 ,l). 

However, while the classes (c4(4); mj^(4)) and (dfc(4); mj^(4)) reduce to (1; 1, 1), and 
thus lie in £, the classes (dfc(4);mj^(4)) do not reduce correctly. O 

We now turn to the proof of Proposition 4.2.2. It follows from Lemma 4.2.6 and 
Proposition 4.2.7 below. 

Lemma 4.2.6. The class (d k (i); m k {i)) satisfies the Diophantine conditions (1.8) for 
the elements of £. 

Proof. Write b := b k (i) and (d;m) = [d k {i); m k (i)) . By Lemma 1.2.6, J2 m s = 
^qwsib) = q(b + 1) - 1 = 3d - 1. Further, ^Tto 2 . = q 2 b + i 2 - i, and by Corol- 
lary 4.1.4 (ii), 

q 2 (b 2 -76+1) = (1 + 3i) 2 - 5 (1 + 3i) - 5 = 9i 2 - 9i - 9. 

Therefore, 

d 2 + 1 = (1 + 6) 2 + 1 = |g 2 (6 2 - 7b + 1) + g 2 6 + 1 

= i 2 -i + q 2 b = ^m 2 , 

as required. □ 
Proposition 4.2.7. The classes E{b k {i)) reduce to (1; 1, 1) for all k> 1 and i > 0. 
Proof. We fix i > 0, and argue by induction on fc > 1. 

Step 1. Assume that k = 1. Set j = l+3i The weight expansion of = [6; 1, j ] =: 

(4.15) }((; + i) x6 ,jM xi )- 
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Together with (1.7) we fiiiddi(i) = §(l + 6i(i)) = 5 (6(j + 1) +j + j + 1) = 3 (8j + 7) = 
8i + 5, and so 

(di(i);mi(i)) = (8i + 5;(3i + 2) x6 ,3i + l,i,l x ( 2i+1 )). 
Applying five standard Cremona moves yields, successively, 

(7t + 4; (3i + 2) x3 , 3i + 1, (2i + l) x3 , i, l x ( 2i+1 )) ; 

(5i + 2; 3i + 1, (2i + l) x3 ,i x4 , l x ( 2l+1 )); 

(3i + l;2i + l,i x5 ,l x ( 2i+1 >); 
(2i + l;i + l,i x3 ,l x ( 2i+1 )); 

(z + l;i,l x ^+ 2 )). 

The standard Cremona move maps (s + 1; s, l xt ) to (s; s — 1, l x (*~ 2 )) for any s > 1 and 
f > 2. Applying i more standard Cremona moves therefore moves + l x ( 2l + 2 )) to 

Step 2. Assume by induction that {d k (i);m k (i)) reduces to (1; 1, 1). We shall show 
that (dfc + i(i);mfc + i(«)) reduces to (dfc(i)i m fc(*)) by five standard Cremona moves. 
The end of the weight expansion qw(b k {i)) = qw(vk{j)) is 

(..., (48j + 41) x5 , 41? +35, (7j + 6) x5 , 6j + 5, (j + l) x5 , j, l xj ). 

Using = L k + F k and L fc+i = 5F fc+i + L k from (4.2), and 

Lfc+i - L k = 5L k + (Lfc - Lfc-i), 

which follows from these two formulae, we see that in general, 

(4.16) m fc (i) = ( (jF k + L fc _!) x6 , (j + l)L fc _i - L fc _ 2 , 

C/F fc _i + L fc _ 2 ) x5 , (j + l)L fe _ 2 " £ fc _ 3 , • • • , 

(jF 2 +ii) x5 , (j + l)Li-L , (j + l) x5 , j, i, l^'"*)). 

It will be convenient to express the numbers djfc(i) in terms of the even Fibonacci 
numbers h k := f 2 k- Thus 

(4.17) hi = 1, h 2 = 3, h 3 = 8, h 4 = 21, /i 5 = 55, /i 6 = 144, . . . 
Lemma 4.2.8. d k {i) = h 2k+2 + — 2)h 2k+ i 

Proof. We fix % and j = 1 + 3i, and write <4 = dfc(i), m k = m k (i), b k = b k (i), etc. Set 
b k = ^. By Lemma 4.1.2 (iii), 

Pk = F k+2 + (j - 6)F fc+ i, q k = F k+1 + (j - 6)F k . 

Therefore, 34 = q k (b k + 1) = p k + q k = F k+2 + (j - 5)F k+l + (j - 6)F k . We thus need 
to show that 

Fk+2 + (j - 5)F k+ i + (j - 6)F k = 3h 2k+2 + 3(i - 2)/i 2fc+1 . 
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This holds true for k = 1 and k = 2, and so it holds true for all k > 1 by Proposi- 
tion 3.2.3. □ 



Lemma 4.2.9. The class (dk+i(i); rrik+i(i)) reduces to (dk(i); rrik(i)) by five Cremona 
transforms. 

Proof. Fix i. By (4.16) and Lemma 4.2.8, the entries of (dk+i(i); mk + i(i)) are given 
by linear formulas in Fibonacci numbers, that depend only on k. Using (4.16) and 
Lemma 4.2.8 one checks for k = 1 and k = 2 that ((4 + i(z); m^ +1 (i)) reduces to 
(dk{i);rrik(i)) by five Cremona transforms with equal reordering at each stage. The 
lemma thus follows from Proposition 3.2.3. □ 

The proof of Proposition 4.2.7 is complete. 

4.3. The ghost stairs. In this section we compute the contribution of the classes 
E(bk{i)) = (dk(i); rrik{i)) , i > 2, to the graph of c(a) = pjp on [r 4 , 7] . The lemma and 
the proposition below are not needed for the results of this paper, but they illuminate 
the role of these classes. 

Lemma 4.3.1. Assume that i > 3. Then fi(dk{i);mf t {i)){a) < ^fa for all a > 1 and 
all k > 1. 

Proof. Write d = dk(i), m = m^(i). Assume that n(d;m)(a) > yfa for some a > 1. 
Let / be the open interval such that [i(d; m){z) > \[z and a £ I. Let ao be the unique 
point in / with £(ao) = l(m). Recall that 

m = (...,», l x ( 2l+1 )). 

Since i > 3, the last block of w(ao) must have length 2z+l according to Lemma 2.1.7 (i). 
But 

\i - (2i + 1)| = i + 1 > v/2T+T , 
in contradiction to Lemma 2.1.8 (i). □ 

Thus, surprisingly, for i > 3 the classes E(pk{i)) give no embedding constraints, but 
nevertheless are most useful to find c(a) on [r 4 ,7]. 

We now look at the case i = 2. For k > 1 write e& := 6^(2) = w^(7) and E(ek) = 
(dk^rrik) := (<ifc(2); m,fc(2)) . Recall from Lemma 4.2.8 that = h,2k+2, where /i2fc+2 = 
/4fc+4 is an even Fibonacci number. We now show that the corresponding constraint 
functions ^(d^m^) form a staircase whose properties echo that of the Fibonacci stairs 
on the other side of r 4 . However this staircase does not add anything new to the graph 
of c(a) because it never rises above the line y = ^jp. Thus we call it the ghost stairs. 
Note also that, although E(ek) is made from w(ek) and so influences c(a) at a = e^, it 
gives a constraint that is centered at the convergent C2k+i < &k- 
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Proposition 4.3.2 (The ghost stairs). 



H(d k ;m k )(z) 



for z G [c 2k ,c 2k+1 ] 
[ IS for ze[c 2k+1 ,e k ]. 



Since for each k we have C2 k < r 4 , the proposition shows that /J,(d k ; m k ){z) = c(z) = 
^±1 on [T 4 ,c 2fc+ i]. 

Proof. Fix A:, and recall from Lemma 4.1.2 and Corollary 4.1.3 that 

(4.18) c 2k = [6; {1, 5}^), 1,5], 

(4.19) c 2k+1 = [6; {1, 5}**" 1 ), 1,6], 
(4-20) e k = [6; {1,5}^), 1,7] = 

and that c 2k < c 2k+1 < e k . If z £ (c 2k , c 2k+1 ) then z = [6; {1, 5}( fe_1 ) ,l,g,h,...] with 
g = 5 and h > 1, while if z G (c 2 fc+i, e^) then 2 = [6; {1, 5}^ fe_1 ^ , 1, g, h, . . . ] with 5 = 6 
and h > 1. In both cases, the weight expansion has the form 

(4.21) «;(*) = (l xe ,z-6,(7-z) x5 ,6z-41,...,x 2k _ 1 (z) = zL k _ 1 -L k , 

(x 2k (z) = F k+ i - zF k ) xg , (x 2k+1 (z)) xh , ...). 

Further w(e k ) begins the same way, but ends at the block of terms 

{x 2k {z) = F k+1 -zF k ) x7 = {a\ k + zf3 2k ) X7 , 

where the last expression uses the elements a? and /3| of equation (2.5) with a = e := e k . 
Since e k has N + 1 blocks where TV is even, Corollary 2.2.7 implies that 

L k +7F k 



tj Xj{e k ) (a e j + zpf) = e k 



r , k -!+7F k ■ 



Further, m k = qw(e k ) where q := L k _i + 7F k , except for the last block where we have 
2, l x5 instead of l x7 . When z has g = 6 it follows that 

m k • w(z) = m k ■ w(e k ) = qw(e k ) ■ w(e k ) = qe k = L k + 7F k+1 . 

Thus fi(d k ;m k )(z) is constant on this interval. On the other hand, if g = 5 then 
x 2k +i(z) = zL k — L k+ \ and we find 

m k -w(z) = S ^qi j x j (e k ){a e j + zfif) - x 2k (z) + x 2k+1 (z) 
3 

= qe k - F k+1 + zF k + zL k - L k+1 
= 6F fc+1 + L k - L k+ i + z(L k + F k ) 
= (l + z)F k+1 . 

But 

34 = g(l + e k ) = L fc _i + 7F k + L k + 7F k+1 = 9F k+1 . 
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Therefore n(dk;rrik)(z) = (z + l)/3 for z £ [c2k,C2k+i] and it remains to check that 
c 2 fc+i + 1 = 3h 2 k+3/h2k+2- Since c 2 k+\ = F k+2 /F k+ i and F k+1 = 3h 2k +2 this reduces 
to the identity 

/4/C+8 + f4k+4 = 3/4fc+6> 

which is readily checked using Proposition 3.2.3. □ 



Remark 4.3.3. (i) At the points v k (j) with j > 7, Theorem 1.1.2 (ii) implies that 
c ( v k(j)) = Vk ^ +1 , which by Lemma 4.1.2 (iii) can be written as 

Uk+2 + jfik+A \ _ hk + j/4fc+2 
hk-2 + jf 4k J hk-2 + jfik ' 

where l^k+2 = 3L^ as in Definition 3.2.4. In particular, at 6^(2) = v k (7) and 6^(3) = 
Wfc(lO), 

^2fc+3\ ^-2fc+2 an( j / hk+5 \ ^4fc+3 

^2fc+l/ ^2fc+l V^4fc+l/ ^4fc+l 

where the /ifc are the even Fibonacci numbers of (4.17). On the other hand, on the left 
of t 4 , where < y/a, we have by Theorem 1.1.2 that 

9n+2 \ 9n+2 + 1 



c(b n ) = c 

\ iJn J iJn 

In other words, the function c attains the value bn t l already at b n 



9n J 9n+l o 

+ 1 J__ _x L 

1- 



(ii) Recall from Section 3.1 that a n = ( 3 ^ +1 ) 2 , and that on the left of r 4 , the classes 
W'{a n ) obtained from W(a n ) = g^w(a n ) by adding one 1 were very useful to establish 
the Fibonacci stairs. One may try to define similar classes at a' n := ( fe ^ +1 ) 2 . Denote 
by W"(a' n ) the sequence obtained from h 2 n w{a' n ) by removing three of the Is at the 
end, and adding one 2. Thus when n = 3 we get 

«3 = (f) 2 , ^V 3 ) = (64 x6 ,57,7x 8 ,2,ix 4 ). 

It is easy to check that the tuple (h n h n+ i;W"(a' n )) satisfies the Diophantine equa- 
tions (1.8). However, when n > 3 this is not an element of £ because it has negative 
intersection with the class (3;2,l x6 ) G £. On the other hand, when n = 2 this gives 
(24; 9 x7 , 2, l x6 ) G £ which as we will see in Theorem 5.2.3 does give an obstruction 
near a = 7j, and when n = 1 we get (3; 2, l x6 ) itself. O 

5. The interval [7, 9] 

This section calculates c on the interval [7,9]. The main arguments are contained 
in §5.2 and §5.3. We begin in §5.1 by establishing some estimates that are most 
useful on [8, 9] but are also needed for some of the arguments concerning [7, 8] such as 
Lemma 5.2.7. 
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5.1. Preliminaries. We begin with a simple result about continued fractions. Let 
q n (a) be the denominator of the nth convergent [^o; £1, ■ ■ ■ ,£n] to the continued fraction 

o:= [£ ;£ 1 ,...,£ N ]=£ + - — j— . 

Thus qi(a) = £i,q 2 (a) = 1 + £\£ 2 and, in general, q n (a) = £ n q n -i{a) +q n ~2{a)- Then 
an easy induction argument shows that: 

Sublemma 5.1.1. Let L := J2jLi^j- Then gjv(o) > L. 

In the sequel, we abbreviate a := Yli>e £ 1 < 1 an d f' : = X^ <i<M-^v e i — <T ' - 

Lemma 5.1.2. Assume i/iai (d;m) £ £ is such that fi(d;m)(a) > yfa for some a G 
(r 4 ,9) mtt £(a) = £(m). A ssume further that y(a) > ^ where q := qN{a), and denote 

vm:=^. Then 

(i) \J2i>i^i\ < 

(ii) IfvM < 1 ^en | ^i>i £ «l ^ \/o 7 Z- 

(hi) //«m < ^, then vm > j a^d v' < \. If vm < | , ^en a' < |. 
(iv) Define 5 := y{a) - | > 0. T/ien 

d< f (v^Z-l) < f (v^-1) < f 
Further, if vm < 1, #&en a can &e replaced by a' . In particular, always 

d<% (1-1) <¥• 

Proof. Step 1: £i>i^ < 0. 

Proposition 2.1.1 (iv) states that 

(5-1) _ Ee . = i + ^( y(a )_I). 

Since we assume that y(a) > -, Step 1 is immediate. 

Step 2: Y,i>t fa\ > IEi>i^l- 

If Yli<£o £ i — 0) then by Step 1 we have 

i>l j>^ «>^0 

as required. Therefore, suppose that J2i<e £ i < 0- Let P = {z > I £ % > 0} and 
Q = {i > £q I £j < 0}. Because toj = 1 for i < ^o> we have 

i<£o i£P i&Q 
i<e «eP 
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Therefore 

> ^£j > ^£j = \Ei\ > l £ *l- 

i>l «eQ ieQ i>^ 

Step 3: Proof of (i). Let a = [^oj^ij • • • >&n] as above, and write e as JV + 1 blocks 
each of length £j . Assume first that e-i is constant on each block with absolute value 5j . 



Let 



tjdj so that |<5j| 



Then 



E 



i>io 



Hence, by Step 2, 



|E- 

i>i 



< 



i>e Q 



a. 



< 



e ^ \/z 



This proves (i) in the case when the £j are constant on the jth block for all j > 1. 
But by Lemma 2.1.7 the only other possibility is that there is precisely one block, say 
the Jth, on which e% is not constant. In that case we subdivide this block into two 
subblocks of lengths lj — 1 and 1. Since the upper bound y/aL depends only on the 
sum of the £j, the argument goes through as before. 

Step 4: Proof of (ii). We abbreviate M' = M — £n, and write V{ := ^=Wj. If the Vi 
are constant on the last block and if vm '■= < 1, then rriM'+i = • • • = itim = 1, and 

so £m'+i = ■ ■ ■ = em = 1 — vm > 0. Since also £E e % < 0) we nave Ej e «l — YliLi e 
Hence, the argument in Step 3 adapts to show that 



IE 

i>l 



Si < 



M' 

E £ < 

i=l 



< 



M 

E 

i>e 



< Vo-'L. 



Proof of (Hi). Assume that vm < If £n > 3, then 

1>^M'+1 + ---+ £ M> 3 (I) 2 > 1 ' 

a contradiction. If = 2, then vm-2 = «m-i + vm = 2-um < |, and so 



l>e M _ 2 + 2e M >{lY + 2{lY = l, 
a contradiction. Further, em > 5 implies that 

< J] ^<a-±<±. 

lo<i<M-2 
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The second claim in (iii) is proved similarly. 

Proof of (iv). We use Sublemma 5.1.1 and equation (5.1) to estimate 

(5.2) ^ > v^L > 1 + (y(a) - ±) = 1 + ^6 = 1 + 6qv M > Sqv M . 

Therefore, ^fq < ^j-, and so, using again (5.2), 

d< f (v^Z-l) < gi^rq-l) < f (^-l). 
If «m < 1, we repeat this argument with a replaced by a'. This completes the proof. □ 

5.2. The interval [7,8]. In this section we calculate c(a) on the interval [7,8]. At 
some places, we will use the computer. We will therefore first prove a weaker result 
that does not use the computer. 

Proposition 5.2.1. There are only finitely many (d;m) G £ for which there is a > 7 
with c(a) = fi(d;m)(a) > y/a. 

Proof. Suppose that fi(d;m)(a) > y/a for some a > 7. Let I be the maximal open 
interval containing a on which fj,(d;m)(z) > y/z, and let ao G / be the unique element 
with £(ao) = l(m). (This exists by Lemma 2.1.3.) If 7 G /, then clearly ao = 7 so that 
(d;m) belongs to the finite set £j. Otherwise, ao > 7. In particular, y(ao) > y(7) = 
8 - 3\/7 > ^. Moreover a < 9 by Corollary 1.2.4. 

Now write ao = p/q. There are only finitely many a = | G [7, 9] with q < 40, and for 
each of them Corollary 2.1.4 shows that there are only finitely many obstructive (d; m). 
We can therefore assume that q := q(ao) > 40 so that y(ao) — | > > 0. Since 
^(ao) = £(m) we can apply the last statement of Lemma 5.1.2 to conclude that 

d < 2(40) 2 V«o < 6(40) 2 . 

Since for each D there are only finitely many (d;m) G £ with d < D, this completes 
the proof. □ 

Remark 5.2.2. The result in Proposition 5.2.1 clearly extends to any interval of the 
form [a, b] provided that a > r 4 . O 

We already know that c(a) = | on [7, 7^] by Proposition 1.2.9. We can therefore 
assume that a G [7| , 8] . 

In order to explain our notation in Theorem 5.2.3 below, we work out the constraint 
given by the class 

(d;m) = (48;18 x7 ,3,2 x7 ) G £. 



Note that £(m) = 7 + 8 = 1(7 \). It gives the constraint fi(d; m)(7|) = ^ > \J 7| at 
7|. For a = 7 + x with x G [\, \\ we have w(a) = (l x7 ,x x8 , . . .). Therefore, 

m • w(a) = 7 ■ 18 + 3x + Ux = 126 + 17 x = 7 + 17a, 
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and so fj,(d;m)(a) = £(7 + 17a). Note that £(7 + 17a) = y/a at ui := 7.12499 

(where the last decimal is rounded). Similarly, for a = 7 + x with x € [|, we have 
u?(a) = (l x7 ,ir x7 , 1 - 7x, . . . ). Therefore, 

m • to (a) = 7 • 18 + 3x + I2x + 2 - Ux = 128 + x = 121 + a, 

and so /x(d;m)(a) = ^(121 + la). Note that ^(121 + a) = a/o at ui := 7.12501 (where 

the last decimal is rounded). The interval containing a = 7| on which this class gives 
a constraint is therefore Ii := [ui,vi], and 



fi(d;m)(z) = < 



^(7 + 17z) if ze [uiJl 



48* 

£(121 + z) ifze [7|,«i]. 



All this is expressed in the first row of the table below. In the same way we compute 
(A,B), (A',B'), u x , v x and fi(a) := fi(d;m)(a) at a = 7 + x for the other seven 
classes in the table below, where we write fi(z) = ^(^4 + Bz) for z just less than a and 
fi(z) = ^(A' + B'z) for z just greater than a. Note that the eight intervals [u x ,u x ] are 
all disjoint. 

Theorem 5.2.3. For a G [7^,8] we have c(a) = ^ except for the eight intervals 
[u x ,v x ] where c(a) is as described in the following table. 



(5.3) 



a 


(d; m) 


(A,B) 


(A',B') 


Ux 


V x 


fi(a) 


fj,(a) - s/a 


7 1 

' 8 


(48;18 x7 ,3,2 x7 ) 


(7,17) 


(121,1) 


7.12499 


7.12501 


1025 
384 


1.27 10" 6 


' 15 


(64;24 x7 ,3 x7 ,l x2 ) 


(14,22) 


(121,7) 


7.1333 


7.1334 


641 
240 


3.25 10" 6 


7i 


(24;9 x7 ,2,l x6 ) 


(7,8) 


(57,1) 


7.1428 


7.1429 


449 
168 


6.63 10" 6 


713- 


(40;15 x7 ,2 x6 ,l x2 ) 


(14, 13) 


(107,0) 


7.151 


7.156 


107 
40 


332.5 10" 6 


n 


(16;6 x7 ,l x5 ) 


(7,5) 


(43,0) 


7.1665 


7.22 


43 
16 


4218.4 10" 6 


7 1 

' 4 


(35;13 x7 ,4,3 x3 ) 


(0, 13) 


(87,1) 


7.2485 


7.252 


377 
140 


274.7 10" 6 


7± 

' 2 


(8;3 x7 ,l x2 ) 


(7,2) 


(22,0) 


7.328 


7.56 


11 
4 


11387.2 10" 6 


8 


(6;3,2 x7 ) 


(1,2) 


(17,0) 


7.97 


8.03 


17 

6 


4906.2 10" 6 



Remark 5.2.4. (i) The above table gives just enough decimal places of the (irrational) 
numbers u x , v x to describe their important features. For example ui = \ (9 + 4\/2) ~ 

7.328 < l\. 

(ii) In the above table there is one constraint centered at each point of the form 7^ 
for 2 < k < 8, except for k = 3 and k = 6. In fact, there are classes (d;m) giving 



«2 
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constraints centered at 7g and 7g, namely 

(96;36 x6 ,35,6 x6 ) at 7± and (24; 9 x6 , 8, 3 x3 ) at 7±. 

But these (d;m) have the property that (i(d;m)(a) = c(a) only at their center points. 
(See the proof of Theorem 5.2.3 at the end of this section for details). 

(iii) The four steps at the points 7g, 7j%, 7y, 7\ are the only ones in the graph of c(a) 
that are not flat to the right. O 

To prove Theorem 5.2.3 we will proceed as follows. Assume that (d; m) G £ is a 
class with 1(a) = £(m) and /j,(d; m)(a) > \fa for some a G [7^, 8]. We first assume that 
a = 7j: for some k G {1, . . . , 8}, and find all such classes (d; m). We then assume that 
a £ l^i^i! an d prove an upper bound D(zu) for <i if a = Zk := 7 2 k+i anc ^ an u PP er 
bound Dk if a ^ Zk- In both cases, we also show that mi = ■ ■ ■ = 7717. We then use 
a simple computer program to find all classes (d;m) as above at z^ with d < D(zi t )- 
Finally, we use another computer program to find all classes (d; m) as above at some 
a ^ Zj~ with d < Dk- 

We start by looking at the boundary points 7\ of our subintervals [7^j, 7^]. 

Lemma 5.2.5. The classes (d;m) G £ such that i(7\) = t(m) and n(d;m)(7^) > 
J7\ are 



(5.4) 



k 


(d; m) 


k 


(d; m) 


8 


(48 


18 x7 ,3,2 x7 ) 


8 


(384; 144 x6 ,143, 18 xS ) 


7 


(24 


9 x7 ,2,l x6 ) 


7 


(I68;63 x6 ,62,9 x7 ) 


6 


(96 


36 x6 ,35,6 x6 ) 


5 


(I6;6 x7 ,l x5 ) 


4 


(35 


13 x7 ,4,3 x3 ) 


3 


(24;9 x6 ,8,3 x3 ) 


2 


(8;3 x7 ,l x2 ) 


1 


(6;3,2 x7 ) 



Proof. We first look at the case a = 7\ = 8. Then i(m) = £(8) = 8. By Lemma 2.1.7 we 
need to consider 3 cases, namely m = (M xS ), m = (M + 1, M x7 ), m = (M x7 , M — 1). 
Consider the case m = (M x8 ). From the Diophantine equations 

3d = 8M + 1 
d 2 = 8M 2 - 1 

we obtain (8M + l) 2 = 9 (8M 2 - l), i.e. 4M 2 - 8M - 5 = 0. This equation has no 
solution in N. In the case m = (M + 1, M x7 ), the Diophantine equations give 

(8M + 1 + l) 2 = 9 (8M 2 + 2M + 1 - 1) 

whose only solution in N is M = 2, giving the solution (d;m) = (6;3,2 x7 ). In the 
case (M x7 ,M-l), the Diophantine equations give (8M-1+1) 2 = 9 (8M 2 - 2M + 1 - l), 
which has no solution in N. 
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Assume now that k £ {2, ... ,8}. In view of Lemma 2.1.7, there are five possibilities 
for m, namely 

(M x7 ,m xfc ), (M + l,M x6 ,m xfe ), (M x6 , M — 1, m xk ), 
(M x7 ,m+l,m x ( fe - 1 )), (M x7 ,m x ( fc - 1 ),m-l). 

Since £(m) = i{7^) = 7 + k, in the first four cases we can assume that m > 1 and in 
the last case we can assume that m — 1 > 1. We define sm and e m by 

M = -t= + e M , m = -^j=+e m . 
V 7 i k \l 7 l 

Case 1. m = (M x7 ,m x/c ). Then \M — km\ = \em — ke m \ < \em\ + k\e m \. Since 
|£m| 2 + k\e m \ 2 < 1, we find |em| + fc|£m| < a/AT+T, and so |M - fem| < \y/k + 1 - 1] G 
{0,1,2}. Set 



s = M — km £ 



{0,±1} if A; € {2,3}, 

{0,±1,±2} if jfe G {4,..., 8}. 



From the Diophantine equations 

J 3d = 7M + km + 1 
\ d 2 = 7M 2 + Arm 2 -1 

we obtain (7M + km + l) 2 = 9 (7M 2 + km 2 - l) . Since M = km + s, this becomes 
10 + km (16 — 9m + km) + 14s (1 — km — s) = 0. 

If s = 1, this is 

10 + km (2 - 9m + fern) = 0, 
which has solutions in N only if k = 5 or 2, namely m = 1, giving 
(I6;6 x7 ,l x5 ) at 7±, (8;3 x7 ,l x2 ) at 7±. 
No other allowed values for s and k yield integer solutions m. 

Case 2. m = (M + l,M x6 ,m xfc ). Then <r = k\e m \ 2 < }. Therefore, |M - km\ < 
l 

' {0} if fc = 2, 

{0,±1} if A;G {3,..., 8}. 
In this case, the Diophantine equations translate to 

(7M + /em + 2) 2 = 9 (7M 2 + 2M + 1 + km 2 - l) . 
With M = km + s this becomes 

(5.6) -4 - km (14 - 9m + km) + 2s (-5 + 7km + 7s) = 0. 



\sm\ + fe|e m | < + y f , and so 
(5.5) s := M -km G 
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If s = 0, this becomes 

—4 — km (14 — 9m + km) = 

which has no solution in N for k G {2, ... , 8}. For s = ±1 and k G {3, . . . , 8} equa- 
tion (5.6) has no solution in N. 

Case 3. m = (M x6 ,M — l,m xfc ). As in Case 2 we have (5.5). In this case, the 
Diophantine equations translate to 

(7M + km) 2 = 9 (7M 2 - 2M + 1 + km 2 - l) . 

With M = km + s this becomes 

(5.7) -km (18 - 9m + /cm) + 2s (-9 + 7km + 7s) = 0. 

If s = 0, this becomes 

18 - 9m + km = 

which has a solution in N for four k, namely k = 8, 7, 6, and 3. This gives the first four 
of the five entries in the table with m\ ^ m-j. 
If s = 1, equation (5.7) becomes 

—4 — km (4 — 9m + km) = 0, 
which has a solution in N only for k = 4. We get the solution (13;5 x6 ,4, l x4 ), which 
is, however, not obstructive, since it gives n{d;m)(j\) = f§ < \J~7\- 
If s = —1, equation (5.7) becomes 

32 - km (32 - 9m + km) = 0. 
It has a solution in N only for k = 2, and gives (19; 7 x6 , 6, 4 x2 ). But again this class is 
not obstructive, since fi(d;m)(7^) = f§ < \J~7\- 

Case 4. m = (M x7 , m + 1, m^" 1 )). Note that for e G R and A; G N with (fe- l)e 2 + 
(e + l) 2 < 1 we have e G [— 1,0] and hence 

|(fc-l)e + (e + l)| = |fce + l| < 1. 

Using this and a > we estimate 

\M - km - 1| = |M — (m + 1) — (fc — l)m\ = \e M - (e m + 1) - (k - l)e m | 

< |£m| + |(fe - l)£m +£m + 1| 



< V^ + 1 < 2 - 

Therefore, 

M-l = km + s with s G {0,±1}. 
In this case, the Diophantine equations translate to 

(7M + fcm + 1 + l) 2 = 9 (7M 2 + km 2 + 2m + 1 - l) . 
With M = km + 1 + s this becomes 

(5.8) -18 + 18m - km (18 - 9m + km) + 14s (km + s) = 0. 
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If s = 1, this is 

—4 + 18m — km (4 — 9m + km) = 0, 

which has a solution in N only when k = 8, m = 2 and k = 7, m = 1, giving us two 
more entries in our table. If s = 0, equation (5.8) becomes 

-18 + 18m - km (18 - 9m + km) = 0, 

which has a solution in N only for k = 4, m = 3. This gives the entry in the table at 
k = 4. If s = —1, equation (5.8) has no solution in N for k G {2, . . . , 8}. 

Case 5. m = (M x7 , m x( - k ~ 1 \ m — 1). As in Case 4 we find 

M + 1 = km + s with s G {0, ±1}. 

In this case, the Diophantine equations translate to 

(7M + km - 1 + l) 2 = 9 (7M 2 + /cm 2 - 2m + 1 - l) . 

With M = km — 1 + s this becomes 

(5.9) -14 + 18m + km (14 - 9m + km) + 14s (2 - km - s) = 0. 

If s = 1, this becomes 

18 - 9A;m + k 2 m = 0. 

It has a solution in N only for k = 6 and k = 3, namely m = 1. Since we assumed that 
m — 1 > 1, the corresponding classes (d;m) are not relevant. If s = or if s = — 1, 
equation (5.9) has no solution in N for k 6 {2, . . . , 8}. 

The above calculations show that the elements listed in Table 5.4 are the only ob- 
structive solutions to the Diophantine equations. One readily checks that these el- 
ements all reduce to (0;— 1) under standard Cremona moves, and therefore belong 
to £. □ 

Remark 5.2.6. Proceeding as in the proof of Lemma 5.2.5, one can find all classes 
(d;m) G £ with ^(d;m)(zk) > ^/zu and £(zk) = £(m) at the points := 7 2fc 2 ) _ 1 , 
k G {1, . . . , 8}, namely 

(64;24 x7 ,3 x7 ,l x2 ) at 7^ and (40; 15 x7 , 2 x6 , l x2 ) at 7^. 

For convenience, we will find these classes by a different method, that involves the first 
of the two computer programs of Appendix B. O 

We next derive upper bounds for d if a G ]7^-, 7^[. There are various ways to do 
this. We will give arguments that give rather low upper bounds, so that our method 
of finding c(a) depends as little as possible on computer computations (compare Re- 
mark 5.2.9 below). Note that a G ]7 ] ^ T ,7|[ has N + 1 blocks with N > 2, and that 
L := Yli>i &i — 2 + k with equality exactly if a = [7; k, 2] = 7 2 k+i = z k- 

Lemma 5.2.7. Suppose that (d;m) £ £ is such that (i(d;m)(a) = c(a) > y/a for some 
a with 1(a) = £(m). Suppose also that a has N + 1 blocks for some N > 2 and that 
a ^ ] 7 FFT) where 8 > k > 1. Then m\ = m-j. Further 
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(i) When L = 2 + k, the following table gives the maximum possible values D(z k ) 
of d for the different k. 



(5.10) 
(ii) 

(5.11) 



k 


8 


7 


6 


5 


4 


3 


2 


1 


D(z k ) 


104 


98 


92 


86 


79 


73 


69 


75 



When L > 2 + k, the following table gives the maximum possible values D k of d 
for the different k. 



k 


8 


7 


6 


5 


4 


3 


2 


1 




88 


81 


74 


67 


61 


56 


64 


66 



Remark 5.2.8. By Lemma 5.2.5 one cannot conclude m\ 
tion a ^ l\. 



m-j without the assump- 

O 



Proof. The proof of this lemma is based on an analysis of the equation (5.1) using the 
estimates for | ^ e «l obtained in Lemma 5.1.2. Recall from (iv) of that lemma that for 



a = 7- and with vm '■= -4= we have the estimates 

q qy/a 



(5.12) 

whenever 5 



that 



(i) The only number in ]7^j-, l\[ with L = k + 2 is z k := [7; k, 2] = 7^^. 



Note 



y(zk) 



+ 



2fc+l 



3a/7 + 



2fc+l 



> 



for all k. By (5.12) we therefore have 

/ a (k + 2) - l) JT k 



d < 



u{k + 2)-\) J7 + 



2k+l 



2k+l 



+ 



7 + 



2fc+i u y ' ^ 2fc+i 

With a < 1 this yields Table 5.10. If mi 7^ 771,7 we may take cr < ^. The largest value 
of d is then < 6 when k < 7 and < 9 when k = 8. But there are clearly no suitable 
(d; m) with such small d. Therefore this case does not occur. This proves mi = 7777 for 
L = 2 + k. 

We will prove (ii) and the claim that mi = 7777 together. We will give separate 
arguments for the three cases k = 1, k = 2 and k G {3, . . . , 9}. Denote a k = 7jrjr[ for 
some 1 < k < 8. We have the table (rounded down to 3 decimal places) 



k = 


8 


7 


6 


5 


4 


3 


2 


1 


y{a k ) > 


J = 0.111 


0.117 


0.125 


0.135 


0.150 


0.172 


0.209 


0.284 



The case k = 1. Assume that (d;m) G £ is a class with n(d;m)(a) > yj~a and 
£(a) = £(m) for some a G ]7|,8[ other than 7|. We first prove that mi = 7777. If not, 
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then a < j. Therefore, vm = > 1 — ^t=j > 0-73, since otherwise a > e 2 M + e 2 M _ 1 > 
2± = f Also, q>L>3 + k = 4, and so y(a) - \ > y(l\) -\> 0.28 - \ > 0. We 
can therefore apply (5.12): 



f >^>i + £(v(«)-j) >i, 

showing that g > 8. Therefore, y(a) - ^ > y(7|) - 1 > 0.28- | > |. Using again (5.12) 
we finally find 

5^ < 8 • 0.73^ < 0.73^ < d < ^ ( - 1 | < 7^ ( — |— - l) < 3^/a, 

a contradiction. 

We now prove that d < 66. For a as above, both numbers 

f(a,q) := ^ 

a + 1 - 3Va- i 



a + 1 - 3 Va - ± y a + 1 - - ± J 

are positive. Moreover, by (5.12) we have d < f(a,q), and using also Lemma 5.1.2 (iv) 
we see that d < g(a, q). We saw above that q > 4. We first use the function / to see 
that for q G {4, 5, 6, 7, 8} we have d < 26. Assume now that q > 9. We then view a 
and q as independent variables of the functions / and g. Both f(a, q) and g(a, q) are 
decreasing functions of a. With a\ = 7\ we therefore have 

d < maxmin{/(ai,q>), g(ai,q)} . 

q>9 

One readily checks that f(ai, q) is increasing on {q > 9} and that g(a\,q) is decreasing 
in q. Since d < f(a±, 56) < 67 if q < 56 and d < g(a\, 57) < 67 if q > 57 we conclude 
that d < 66, as claimed. O 

Remark 5.2.9. This method for estimating d can be used for all k < 8. However, 
the estimates get worse, e.g. for k = 8 (with the factor y/q — 1 of / replaced by 



+ I — 1, see (5.13) below) one finds d < 410. One could also omit checking that 

obstructive classes have mi = mj, and use a variant of our computer code SolLess 
from Appendix B.l that does not use m\ = 777.7. O 

The case k = 2. The class (8; 3 x7 , l x2 ) gives the constraint c(a) > /io( a ) = ^-g^ > v 7 "- 
on [7|,7^]. Assume that (d;m) G £ is a class with n(d;m)(a) = c(a) > for some 
a G [7|,7|]. Proposition 2.1.1 (i) implies that 

7 + 2a 



< n(d;m)(a) < ^y/l + T/S. 
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When a = 7\ this gives the estimate d < 64. Since decreases on [7|, 7^], we find 
d < 64 everywhere. We will check m\ = 7717 for k = 2 and L>3 + fc = 5at the same 
time as for k > 3. O 

The case G {3,..., 9}. Suppose that a G ]7^r,7^[ for some k > 2, and that 
L > 3 + k. Then we may write 

a= [7;M 2 ,...,4v] = 7+^-r 

where ^ := a' := [0;£ 2 , ■ ■ ■ ,£n]- Thus ^ = gjv-i(a') > Ej>2^j = : L ' h Y Sub " 
lemma 5.1.1, and so 

L := = k + L' < k + q. 

Since q = kq' + j/ we find L < k + %. Moreover, q' > V = L — k > 3, and so 
q > 3k + 1. Therefore, for a G ]«fc,afc-i[ we have y(a) > y(afc) > ^-j- > |. Thus the 
inequality (5.12) implies that 

(5-13) (1 " jjs) + >(a) = 1 + £ (y(a) - j) < + 

Case 1: I < v M := ^ < f . 

Because y(a) > y(ak) for all a G [afc)Ofe-i] an d y(as) = |, we must have 

4 < i + < i + >(«*) < (1-575) + ^ < , 

where c' < | is as in Lemma 5.1.2 (hi). 

Note that the squared error of the last two is at least 2(^) 2 = |. Therefore, if 
also mi 7^ 771,7, we have a' < ^ — | = ^. But, for each G [2, 8], the inequality 

holds only if g 2 < 6(/c + |). Since this quadratic inequality holds for g = and does 
not hold when q = 3k + 1, it does not hold for any g > 3k + 1. Therefore, for each k 
we have mi = 771,7, and cr' < |. 

Now suppose that k = 8, and consider the inequality 

| + fy(«8) = | + T8-< v f(8 + §). 

This holds when q = but does not hold for q > 63. Thus g < 62 so that ^| < 
V I + t) ~~ 3" Since a < 7| we get d < 83. The same argument works for the 
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other k, and we obtain the following upper bounds for q and then for d. 



k = 


8 


7 


6 


5 


4 


3 


q< 


62 


58 


53 


49 


45 


41 


d< 


83 


77 


71 


66 


61 


56 



Case 2: vm < \- 

Since the squared error i^b 2 N on the last block is now at least ^, we must have 
mi = 7717 an( i o* < \- Further, by Lemma 5.1.2 (hi), > |. Therefore (5.13) gives 

h + ly{a k ) < \ + f a y{a k ) < sj\{k + l). 

This gives the following upper bounds for q and d. 



k = 


8 


7 


6 


5 


4 


3 


q< 


62 


57 


53 


49 


45 


42 


d< 


55 


51 


47 


43 


40 


37 



Case 3: | < v M < 1- 
Now (5.13) gives 

\qy(a k ) < ^y(a k ) < \J a(k + |) . 

If o < this is not satisfied when q > 3/c + 1 for any fe G {2, . . . , 8}. Thus ttt-i = my. 
Further, taking a = 1 we obtain the following upper bounds for q and c?. 



k = 


8 


7 


6 


5 


4 


3 


q< 


44 


40 


37 


33 


30 


26 


d< 


88 


81 


74 


67 


60 


53 



Case 4: 1 < vm- 

In this case, (5.13) gives 

qy(a k ) < l + q(y(a k )-±) < 1 + f a {y(a k ) - \) < l + ^y(a)-\) < ^ a{k + f) . 

We have already seen in Case 3 that qy(a k ) < o{k + |) is impossible for a < j. 
Thus mi = m 7- 

Further, taking a = 1 we obtain the following upper bounds for q and d. 



k = 


8 


7 


6 


5 


4 


3 


q< 


31 


28 


25 


22 


19 


17 


d< 


82 


75 


68 


61 


54 


46 
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Taking for each k the worst upper bound for d in the different cases, we obtain Ta- 
ble 5.11. This completes the proof of Lemma 5.2.7. □ 

Corollary 5.2.10. (i) The only classes (d;m) G £ such that £(zk) = £(m) and 
such that fi(d;m)(zk) > yfzT are 

7^: (64;24 x7 ,3 x7 ,l x2 ) and 7^: (40; 15 x7 , 2 x6 , l x2 ) 

(ii) There are no classes (d;m) G £ such that £{a) = £(m) and fi(d;m)(a) > \fa 
for some a G ]7|, 8[ not of the form 7^ or 7 2 k+i ■ 

Proof, (i) The computer code SolLess [a,D] given in Appendix B.l finds for a rational 
number a and a natural number D all classes (d; m) G £ with £{m) = £{a) and 
n(d;m)(a) > \fa and d < D. For k G {1, ... ,8} we choose D = D{zk) as given by- 
Table (5.10). The code SolLess [a, D] with D = D(zk) and a = Zk tells us that for 
k = 7 and k = 6, the only such classes are the ones given in the corollary, while for the 
other k there are no such classes. Finally, one checks that the two classes in (i) reduce 
to (0; —1) under standard Cremona moves, and hence belong to £. 

(ii) The computer code InterSolLess [k,D] given in Appendix B.2 provides for a 
natural number D a finite list of candidate classes (d; m) G £ with £(m) = £(a) and 
fi(d;m)(a) > \fa and d < D for some a G ]7^j,7^[. For k G {1, ... ,8} we choose 
D = Dk as given by Table (5.11). The code InterSolLess [k,D] with D = Dk tells 
us that for k ^ 4 there are no candidate classes, while for k = 4 the only candidate 
class is (d;m) = (59; (22 7 , 5 3 , 4, l 3 )). Since the length of the second block is 4 and 
the length of last block is > 2, the a in question must be [7; 4, 3] or [7; 4, 1,2]. The 
second possibility is excluded by Lemma 2.1.8 (i) applied to the third block. Moreover, 
at a = [7; 4, 3] = 7^ we have n(d;m)(a) = < y/a, which excludes also the first 
possibility. □ 



Proof of Theorem 5.2.3. Recall from Proposition 1.2.9 that c(7|) = y 7|. More- 
over, by Lemma 2.1.3 any class (d;m) G £ with /x(d;m)(8) > y/8 must lie in £g. By 
looking at the list of elements in £g given in Lemma 1.2.7 one checks that the only 
such class is (6;3,2 x7 ). By using Lemma 2.1.3 once more, we conclude that all con- 
straints on [7^,8] come from the ten classes of Lemma 5.2.5 and the two classes from 
Corollary 5.2.10. 

In the paragraph just before Theorem 5.2.3 we worked out the constraint /j,(d; m) 
given by the class centered at 7|. Similar computations show that all the eight classes 
in Table 5.3 behave as described there. In order to prove Theorem 5.2.3, it therefore 
remains to check that the four classes from Lemma 5.2.5 that do not appear in The- 
orem 5.2.3 give no further constraints. However, one can calculate the corresponding 
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functions [i(d; m) just as before, obtaining the following data. 6 
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(5.14) 



a 


(rf; m) 


(A,B) 


(A',B>) 


H(a) 


N(A,B) 


s 


' 8 


(384; 144 x6 ,143, 18 x8 ) 


(-1,144) 


(1025,0) 


1025 
384 


74322 


18 


' 7 


(168;63 x6 ,62,9 x7 ) 


(-1,63) 


(449, 0) 


449 
168 


14373 


9 


' 6 


(96;36 x6 ,35,6 x6 ) 


(-1,36) 


(257,0) 


257 
96 


4758 


6 


' 3 


(24;9 x6 ,8,3 x3 ) 


(-1,9) 


(65,0) 


65 
24 


327 


3 



In all cases the new constraint takes the same value at its center point as the old one 
but the slope to the left is steeper (because A = — 1) and it is flat (i.e. with B' = 0) 
rather than increasing to the right. This completes the proof. □ 

5.3. The interval [8,9]. In this section we compute c(a) on the interval [8,9]. We 
first prove that c(a) = yfa for a > 8^. 

Lemma 5.3.1. Suppose that fi(d;m) (a) > y/a for some a G [8^,9) with £(a) =£(m). 
Then d < 16 and mi = • • • = m$. 

Proof Note that y(a)>y(8± 



36 • 



19 

36 



> | for all q > 2. Assume first that q > 12. Then 
5 := y(a) — ^ > |§ — = f • Suppose that mi / rag. Then a < |, and hence % > §• 
This and (5 > g shows that < 1, which is impossible by Lemma 5.1.2 (iv). In order 
to prove that d < 16, note that 

ifvAf then £<^§ = §; 



14. 

9 ' 



if UM >|,then^<|. 



Lemma 5.1.2 (iv) with \fa < 3 therefore shows that 

j < 3 (m i _ i") - 27 5 
" — 4/9 \ 9 4/9 1 J 

and hence cZ < 16. 



4 2 



< 17 



Assume now that q < 11. Note that a < 8^ and S 



Lemma (5.1.2) (iv) therefore shows that 

d < 



V(a) - \ > y(8|) - \. 



9 



y(8 



r(V9-i) 



9' <? 

The RHS is < 17 for all q G {2, . . . , 11}, and so d < 16. Suppose that mi 7^ mg. Then 
a < |. If g < 8, then ^faq— 1 < 0, contradicting (iv) of Lemma 5.1.2. If q G {9, 10, 11}, 
then 



d < _1§_ ^ 2 

9^-9^ 3' 



We also calculated the number N(A, B) of integer points in the triangle T% B and the number s of 
integer points on its slant edge because of their relevance to Remark 2.4.5. 
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and hence e ■ e > | + 2 ■ g > 1, a contradiction. □ 
Proposition 5.3.2. c(a) = y/a for a G [8^,9). 

Proof. Suppose to the contrary that p,(d; m) (a) > v^forsomea > 8^. By Lemma 2.1.3 
we may choose ao with £(cio) = £(m) in the interval / containing a on which this in- 
equality holds. 

We first claim that ao > 8. By Lemma 2.1.3 it suffices to see that £(m) > 8. One can 
prove this by explicit calculation since is finite. In fact, the last obstruction given 
by the elements of £% is that centered on a = 8 which is discussed in Remark 1.2.10. 
As we saw there, this is not effective when a > 8^. 

It then follows that ao > 8^. For if not, because / contains a > 8^, it must also 
contain 8^. But clearly £(z) > £{8±) = 8 + 36 = 44 for z G (8,8^). Therefore the 
minimum of £{z) on / cannot occur in this interval. 

We may therefore apply Lemma 5.3.1 to ao- Hence d < 16 and m := m\ = rag. 
Since rrii = 3d — 1 < 47, we must have m < 5. It remains to check that there are no 
solutions to the Diophantine equations (1.8) for any choice of m < 5. 

Suppose first that m = 5. We then look for solutions of 

(5.15) 3d-l = 40 + ^mi, d 2 + 1 = 200 + ^ mf. 

i>8 i>8 

The second equation shows that d G {15, 16}. For d = 15, (5.15) becomes 4 = ^2 i>8 rrii, 
26 = £ 

i>8 m ?' which has no solution. For d — 26, (5.15) becomes 7 — Yli>8 m i> 
57 = J2i>8 m h which has no solution either. For m < 5 there are no solution either. □ 

Corollary 5.3.3. c(a) = ^ for a G [8,8^]. 

Proof. The class (d; m) = (6;3,2 x7 ) gives c(a) > fi(d;m)(a) = ^? = ^/ 8^ for a > 8. 
Therefore c must be constant on this interval because it cannot decrease. □ 

Appendix A. Weight expansions and Farey diagrams 

In this section, we show that the weight expansion w(a) described above agrees 
with the expansion considered in [15]. For clarity, we call the latter the Farey weight 
expansion; see Definition A. 7. It arose from a procedure of constructing an outer ap- 
proximation to an ellipsoid by repeated blowing up. After explaining this, we establish 
the equivalence of the two definitions in Corollary A. 10. No doubt, versions of this 
result are already known. However, since it is not hard, we give a direct proof in our 
context. 

Definition A. 4. Let (pi = Pi/qi), i = 0, . . . , N, be a sequence of rational numbers in 
lowest terms, with po = 0/1, pi = 1/1 and pi > for i > 0. We say that the two 
elements Pj, Pk are adjacent in (pi) if they are neighbors when the numbers po, . . . , pn 
are arranged in increasing order. Further (pj) is called a Farey expansion of the 
rational number a if the following conditions hold: 
(i) p N = a; 
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(ii) qi < q i+ i for all i>l; 

(iii) adjacent pairs p/q,p' /q' of elements of (pi) have the property that 
(A.l) \pq' -p'q\=l; 

(iv) condition (iii) does not hold if any term is removed from this expansion. 
Example A. 5. The Farey expansion of | is 1, \, §, |, | which may be arranged as 

I<4 < 3 21 
2^7^5^3^1' 

Lemma A. 6. (i) Every positive rational number a has a Farey expansion. 

(ii) This expansion is unique. Moreover, if pi, ■ ■ ■ , pn is the Farey expansion of a = pjy, 
then for all n < N, p\, . . . , p n is the Farey expansion of p n . 

Sketch of proof. Given positive fractions pi := ^ and pj := we define their Farey 



sum to be 



Pj 



Pi + Pj 



Ii + Qj 

If < a < 1, the expansion is constructed inductively starting with po = and p\ = 1, 
in such a way that Pi+\ := p% ® Pj where j is the largest number < i such that a lies 
between pi and pj. Thus p2 = \, and p% is either | (if a < p^) or | (if a > p^). The 
construction stops when a = pjy- 

If a lies between k and k + 1, then the expansion begins with the terms pi := y, 
i = l,...,fc + l. Then pt+2 = Pk® Pk+i = and the expansion proceeds as in the 
previous case. Further details may be found in Hardy and Wright, [8, Ch. III]. □ 



(1,0) =vq 




a(0,1)=«-i 



e-i 



Figure A.l. The Farey diagram for p/q = 5/3. Here v\ = (1, 1), vi = 
(1,2), V3 = (2,3), t>4 = (3,5). The edge 62 meets e\ and eo, while 63 
meets e\ and €2, and £4 meets 62 and £3. 

One can build a diagram in R 2 corresponding to a given Farey expansion by as- 
sociating to each fraction Pi/qi a line segment (called an edge) with normal vector 
V{ := (qi,Pi) of slope Pi/qi. See Figure A.l. One starts with the first quadrant whose 
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edges e_i,eo are the positive coordinate axes with (inward) normals (q-±,p-i) = (1,0) 
and (qo,po) = (0, 1), and builds up a sequence of edges by cutting along certain direc- 
tions. The first cut is along an edge e\ going from e_i to eo with normal v\ = (1,1). 
In general, if pi is the Farey sum of pj with for some j < i — 1 then the ith cut 
is along an edge q with normal Vi that meets the edges ej and ej_i (but none of the 
others). The collection of edges ei, . . . , ejv is called the Farey diagram; the extended 
Farey diagram also includes the edges e_i,eo- 

As described in [15, §3], adding a new edge whose normal is the sum of the two 
adjacent normals corresponds to a (smooth) blow up, since in the toric model, each 
blow up corresponds to cutting off a corner of the moment polytope. Therefore we 
can think of the process of constructing the Farey expansion for a as the process 
of blowing up the first quadrant repeatedly and in as efficient a way as possible, in 
order to obtain a (smooth) polytope with one edge whose normal has slope a. In the 
language of [15], this is an outer approximation; see Figure 3.1 and Lemma 3.8 ff. in [15]. 
For further discussion of the relation between weight sequences and the resolution 
of singularities by blow up, see the end of [16]. This contains a description of the 
Riemenschneider staircase that links the weight expansion for a to the Hirzebruch- 
Jung continued fraction expansions for the two singular points at the vertices of the 
toric model of the ellipsoid E(l, a). 

Given such a sequence of edges ei, . . . , ejv one can define an associated sequence of 
Farey labels Ai, . . . , Ajv as follows, starting with the last ejv that is labeled by Xn ■= 1. 

(*) If 6j,j > n, is labeled by Xj, label e n with the sum of the labels of the edges 
€j,j > n, that intersect e n . 

Definition A. 7. If a = p/q has Farey diagram with labels Aj, 1 < i < N, the Farey 
weights of a are the numbers U{ := Aj/Ai, for i = 1, . . . , N. 

Note that reflection in the line p = q converts the Farey diagram for p/q into that 
for q/p. Therefore the Farey weights for p/q and q/p are equal. 

These Farey weights are the weights considered in [15]. Our aim in this section is to 
show that these agree with the weights w(a) of Definition 1.2.5. 

Example A. 8. One can see from Figure A.l that when p/q = 5/3 the labels Aj (in 
decreasing order) are A4 = 1, A3 = 1, A2 = 2, Ai = 3 which gives the Farey weights 
1, |, |, |. These agree with the weight expansion constructed in Definition 1.2.5. 

In the following we will denote the distinct Farey labels for a = p/q by hi > hi > 
■ ■ ■ > hs > and will suppose that they occur with multiplicities n±, . . . , n$. Thus we 
write 

(A!,A 2 ,...,A W ) = {h* n \...,h x s ns ). 

Proposition A. 9. Let h± > ■ ■ ■ > hs > be the distinct Farey labels for a = p/q and 
suppose that they occur with multiplicities n±, . . . ,n$. 

(i) If a £ (k, k + 1] then n\ = k and hi = q, hi = p — kq; 

(ii) If a £ \\/(k + 1), 1/k) then n\ = k and h\ = p, hi = q — kp; 
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(iii) In both cases the hi for 1 < i < S satisfy the recursion relation 

hi = n i+ ih i+ i + /ij + 2, 

where h$+i := 0. 

Corollary A. 10. For all a > 1 the weights w(a) of Definition 1.2.5 are the Farey 
weights of a. 

Proof. If we write w(a) as w(a) = ^l^,a;^ 2 , • • • ,x^f K ^j then Definition 1.2.5 implies 

that the Xi are characterized by the properties that x\ = 1, Xi > Xj+i > and the 
recursive relation 

Xi = £ i+ ix i+ i + x i+2 . 

Since the Aj are positive and nonincreasing, Proposition A. 9 shows that Farey weights 
Aj/Ai have precisely the same characterization. □ 

Proof of Proposition A. 9. Reflection in the line p = q converts the Farey diagram 
for p/q into that for q/p. Therefore statements (i) and (ii) are equivalent. We will 
prove all three statements together by an inductive argument. 

We use the extended diagram obtained by adding to the edges ei, . . . ,ejv the edge 
eo with normal vq = (0, 1) and the edge e_i with normal v-\ = (1,0). When a > 1 we 
order them as e_i, eo, ei, • • • , and then label these as in (*) above; when a < 1 we order 
them as eo, e_i, e±, . . . , and then label them using (*). 

li k < a < k + 1, the Farey expansion starts with l,2,...,fe,fc + l and then contains 
further elements between k and k + 1. It follows that n\ = k. Further, because the 
only edges meeting eo are e±, . . . , e^+i, we have 

(A. 2) A = k\i + A fe+ i = n\h\ + h 2 . 

Similarly, because the only edges meeting e_i are eo and e±, we have A_i = Ao + Ai. 
Therefore (i) is equivalent to 

(iv) n\ = k and Ao = p, A_i = p + q when p/q > 1. 
Similarly, if a € (l/(fc + 1), 1/k) we find that 

n\ = k, A_i = k\\ + \k+i = n\h\ + h 2 , A = A_i + Ai. 

Hence (ii) is equivalent to 

(v) n\ = k and A_i = q, Ao = p + q when p/q < 1. 

We argue by induction on N, the length of the Farey expansion of a. By symmetry, 
it suffices to consider the case when a G (k, k + 1]. The result is clear when a = k + 1 
(and also for the trivial case a = 1 which has a single label Ai = 1). Point (i) is easily 
checked when N = k + 2 since then p/q = (2/c+l)/2. Similarly, one can check it for the 
two numbers (3k + l)/3, (3k + 2)/3 with N = k + 3. (Note that v k+2 = (2, 2k + 1) and 
v k+3 is either (3, 3k + 1) = (1, k) (2, 2k + 1) or (3, 3k + 2) = (2, 2/e + 1) (1, fc + 1).) 
Now, consider the matrix 
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that takes the vectors (1, k), (1, k + 1) to (1, 0) = v-±, (0, 1) = vq. Then 

Therefore \ip/q = : fc + x, we find j//V = x/(l — x). In particular, p'/V > 1 if and only 
if x > 1 — x, that is, exactly if rii = 1. 

Because detA k = 1, A k preserves the Farey addition relation between adjacent 
normals. Hence if v\ = (1, l),t>2, • • • , vn are the normals in the diagram for p/q, the 
normals for the diagram for p' /q' are 

AkVk+2 = (1, 1), A k V k +3, ■■■ , A k VN- 

In fact one could construct the diagrams for p/q and p'/q' so that there is an affine 
transformation obtained by following A k by a suitable translation that takes the stan- 
dard diagram for p/q to the extended diagram for p'/q'. Therefore, if p'/q' > 1, the 
labels X k , X k+ i, . . . , Xn for p/q equal the labels X'_ 1 , X' , X[, . . . , X' N _ k _ 1 of the extended 
diagram for p' /q' . Hence: 

if p'/q' > 1, then n>2 = 1 and the multiplicities for p'/q' are ns,...,ns with 
corresponding labels h% , . . . , hs ■ 

Therefore because the recursive relation (iii) holds for p'/q' it holds for p/q and i > 3. 
Further, by equation (A. 2)) and (iv) applied to p' /q' , Ag = 713/13 + /14 = p' and A'_ : = 
A + X[ = p' + /13. Therefore, since ri2 = 1, 

h2 = Xk+i = A = n 3 /i 3 + /14 = p, hi = Ai = A'_i = n 2 /i2 + /?-3- 

This shows that (iii) holds for p/q. Moreover, /12 = p' = p — kq and, by (i) for p' /q' , we 
find h\ = p' + /ij = p' + q' = q. 

This completes the proof when p'/f/' > 1. When p'/q' < 1, the proof is similar. By 
(v), the labels X' , X'_ 1 , X\, . . . (note the reordering) for the extended diagram for p'/q' 
are A^ + A^+i, A&, X k+ 2, ■ ■ ■ , with first multiplicity n[ = ri2 — 1. Further details will be 
left to the reader. □ 



Appendix B. Computer programs 

B.l. Computing c(a) at a point a. In this section we describe a Mathematica pro- 
gram SolLess [a,D] that finds for a rational number a and a natural number D all 
classes (d; m) G £ with £(m) = £(a) and /j(ci; m)(a) > ^/a and d < D. We have applied 
this program in the proof of Theorem 5.2.3 to eight numbers z k = 7 2k +i in \j\-> The 
present program can be used for all a. By removing one line, one obtains a program 
finding all obstructive solutions at a with d < D (not just those with £(m) = 1(a)). 

Recall from Remark 5.2.6 that instead of using the code SolLess, one can use the 
algebraic method from the proof of Lemma 5.2.5 to find all obstructive classes (d; m) 
at z k with i(m) = i(z k ). We have chosen to use this code for convenience, and because 
it might be helpful for understanding the more involved code of § B.2. 
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We start with computing the weight expansion w(a) of a rational number a. For 
convenience, we use that the multiplicities of to (a) are given by the continued fraction 
expansion of a. 

W[a_] := Module [{aa=a,M,i=2,L,u,v>, 

M = ContinuedFraction [aa] ; 
L = Tabled, {j ,M[ [1]] }] ; 
{u,v> = {1 , aa-Floor [aa] } ; 
While [i <= Length [M], 

L = Join[L, Tablet v, {j,M[[i]]>] ]; 

{u,v> = {v,u - M[[i]] v>; 

i++] ; 
Return [L] ] 

For instance, W [3+2/3] yields {1,1,1,2/3,1/3, 1/3}. 

We next give for each natural number k a list of 4 vectors, from which we will construct 
candidates for the vectors m. 

P[k_] := Module [{kk=k,PP, TO, i>, 

TO = Table [0,{u,l,k>] ; 
TOp = ReplacePart[TO,l,l] ; 
Tl = Table [l,{u,l,k>] ; 
Tim = ReplacePart [Tl,0,-1] ; 
PP = {TO, TOp, Tl, Tim}; 
Return [PP] ] 

For instance, P [3] yields {0, 0, 0}, {l, 0, 0}, {l, 1, l}, {l, 1, 0} . 

Our next task is to construct for given a all candidate vectors m. To this end we 
first take a given multiplicity vector M, say (k±, k2, k^), and associate to it all vectors of 
length k\ + k2 + ks such that that the j th block is a vector from P Ikjl . In the example 
we thereby obtain 4 3 vectors . We use the sets P [k] and a recursion: 

Difference [Mj : = Module [{V=M , vN , VI , 1 , L={} , D , PP , i , j , N> , 

1 = Length [V] ; 

If [ 1 == 1 , L = P [ V[[l]] ]] ; 
If[ 1 > 1, 

vN = V[[-l]] ; 

VI = Delete [V,-l] ; 

D = Difference [VI] ; 

PP = P [vN] ; 

i = 1; 

While [ i <= Length [D], 

j=i; 

While [j <= Length [PP], 
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N = Joint D[[i]] , PP[[j]] ] ; 
L = Append [L,N] ; 

i++] 

]; 

Return [L] ] 

We now take a positive rational number a and d G N and compute all solutions of the 
Diophantine equation with d given that are obstructive at a: We first take the multi- 
plicity vector M of to (a), and then round down each of its entries, getting F. In view 
of Lemma 2.1.7, an obstructive multiplicity vector m must be of the form F+D [ [i] ] , 
where D[[i]] is the ith vector from the list Dif f erence [W [a] ] . We therefore run 
through this list, and each time check whether V=F+D [ [i] ] is a solution of the Dio- 
phantine system, has last entry positive, and is obstructive: fj,(d;V)(a) > \[a. If all 
three conditions are fulfilled, we add V to our list, and also retain d. 

Sol[a_,d_] := Module [{aa=a,dd=d,M,F,D, i ,V,L={}> , 
M = ContinuedFraction [aa] ; 
F = Floor [ dd/Sqrt[aa] W[aa] ]; 
D = Dif f erence [M] ; 

i=i; 

While [i <= Length [D] , 

V = Sort [F+D [[i]] , Greater]; 

SV = Sum[ V[[j]], { 3,1, Length [V] } ]; 

If [ {SV, V.V> == {3dd-l, dd~2+l} 

kk V[[-l]] > 

kk W [aa] . V / dd >= Sqrt [aa] , 

L = Append [L, V] 

] ; 
i++] ; 

Return [{dd, Union [L]>] ] 

For instance, Sol [7 + 1/8, 48] yields 

{48, {{18, 18, 18, 18, 18, 18, 18, 3, 2, 2, 2, 2, 2, 2, 2}}}. 

Remark B.l. (i) We were not at all economical when constructing the list Difference [M] : 
In view of Lemma 2.1.7, for an obstructive vector F+D[[i]] there is at most one kj 
such that the vector P Ikjl appearing in D [ [i] ] can have both and 1 as entries. We 
have chosen this form of the program to make it more readable. 

(ii) In the main body of the paper, we applied this program only to the eight numbers 
Zk = ^ ' 2k+i ' an d f° r these numbers we know that mi = mj by Lemma 5.2.7. We did 
not use this information so as to make the program applicable also at other points, 
e.g. to l\ in order to check Lemma 5.2.5 (at least for all d < 2000 or so). 
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(iii) Recall from Lemma 2.1.3 that for every (d;m) that gives an obstruction at a 
we have £(a) > £(m). The condition V[[-l]] > asked in Sol[a,d] is therefore 
equivalent to £{a) = £(m). By removing this condition, we obtain a program finding 
all obstructive solutions (d; m) at a. O 

We finally collect, for given a and D 6 N, all solutions that are obstructive at a and 
have d < D: 

SolLess[a_,D_] := Module [{aa=a,DD=D,d=l ,Ld,L={}}, 

While [d <= D, 

Ld = Sol [aa,d] ; 
If [ Length [ Ld[[2]] ] > 0, 
L = Append [L , Sol [aa,d] ] 

]; 
d++] ; 
Return [L] ] 

B.2. Computing c(a) on an interval. In this section we describe a Mathematica 
program InterSolLess [k,D] that provides for a natural number D a finite list of 
candidate classes (d;m) G £ with £{m) = £(a) and fi(d;m)(a) > ^fa and d < D for 
some a G ]7^j, 7|[, a ^ z^, where Zk = [7; k, 2]. We have applied this program in the 
proof of Theorem 5.2.3 to the eight intervals ]7^xj,7^[, k G {1, . . . ,8}. Throughout 
we assume that a, b and the rrii are positive integers. 



(B.l) 



Our first goal is to list for a given pair a, b all solutions of the Diophantine system 



To illustrate our method, let us find in an algorithmic way the solutions m of (B.l) 
for (a, b) = (4, 6). It suffices to list solutions m = (mi,rri2, ■ ■ ■ , vhm) with m\ > mi > 
■ • • > rriM- We must have mi < |_\/6j = 2. We therefore try with m\ = 1 and 
m\ = 2. For a solution m, the next numbers (7712,7713,...) must fulfill (B.l) with 
(o, b) = (4 - 1, 6 - 1) = (3, 5) and (a, b) = (4 - 2, 6 - 4) = (2, 2). In the first case (when 
mi = 1); we only need to try with 777,2 = 1- The next numbers (777.3, • • • ) of a solution 
must then fulfill (B.l) with (a, b) = (3- 1, 5- 1) = (2, 4). Then a 2 = b, so that the only 
solution is 777,3 = 2. But 777.3 = 2 > 1 = 7772, whence we discard the solution (1, 1,2). 
In the second case (when mi = 2), we try to find numbers (777,2,777,3, . . . ) solving (B.l) 
with (a, b) = (4 — 2, 6 — 4) = (2, 2). We only need to try with 777,2 = 1, and then want to 
solve (B.l) with (a, b) = (2 — 1, 2 — 1) = (1, 1) for 7773. Since a? = b, the only solution 
is 777,3 = 1- We therefore find the solution m = (2, 1, 1). 

The code Solutions [a, b] below does the same thing by a recursion. Note that if 
a? < b, then (B.l) has no solution. 

Solutions [a_,b_] := Solutions [a, b,Min [a, Floor [Sqrt [b] ]] ] 
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Solutions [a_,b_,c_] := Module [{A=a,B=b,C=c , i ,m,K,j ,V,L={}} , 

If [ A~2 < B, L={>] ; 
If [ A~2== B, 

If [ A > C, L=0, L={{A}} ] ] ; 
If [ A~2 > B, 

i=i; 

m = Min [Floor [Sqrt [B] ] ,C] ; 
While [i <= m, 

K = Solutions[A-i,B-i~2,i] ; 

j=i; 

While [j <= Length [K] , 

V = PrependE K[[j]] , i] ; 
L = Append [L,V] ; 

]; 

i++] 

]; 

Return [Union [L] ] ] 

Notice that applied to (a, b) = (3d — 1, d 2 + 1), the above algorithm lists all solutions of 
our principal Diophantine equation. For large d, however, there are many solutions. We 
shall therefore directly choose the first 7 + k + 1 numbers nii, using that for obstructive 
solutions the vectors m and w(a) must be essentially parallel, and shall then use the 
code Solutions only to choose the remaining 

It will be useful to have a short expression for the sum of the entries of a vector L: 

sum[L_] := Sum[ L[[j]], {j , 1 .Length [L] > ] 

For given k > 1 the following code gives three vectors of length 7 + k that have all 
entries equal to except that the last entry of the first vector is —1 and the eighth 
entry of the third vector is 1. 

P[k_] := Module [{kk=k,PP, TO, i>, 

TO = Table [0 , {i , 7+kk}] ; 
Tm = ReplacePart [TO ,-1,-1] ; 
Tp = ReplacePart [TO , 1 , 8] ; 
PP = {Tm,T0,Tp}; 
Return [PP] ] 

For k = 4, this gives {0 7 , 0, 0, 0, -l}, {0 7 , 0, 0, 0, 0}, {0 7 , 1, 0, 0, 0}. We shall use these 
vectors to take into account that the rrii may not be constant on the second block. 

Fix d G N and k G {1, . . . , 8}. Assume that m is such that (d; m) G £ and such that 
l(m) = 1(a) and /u(d;m)(a) > a/o for some a G ]7^]-,7^[. 
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In view of Lemma 2.1.7 we have mi = mj 
a<7±, 

mi = 4=+£i > 



. Moreover, since |ei| < < \ and 



and hence m\ > ml : = Round 



. In the same way we see that m\ < Ml := Round 



The number m\ = m-j must therefore be in the interval [ml, Ml]. 

Next, consider rrij for j G {8, . . . , 7 + k}. For a 
Therefore, 



7 + x we have pj-j- < x < p 



rrij 



7E x + £ j > 



[ZTk+l 
' k 



1 



and hence rrij > mx 



way we see that rrij < Mx := 



i 



1 = Ceiling I — f=^-^ J — 1. In the same 



i 1 

~Tk 
fc+i 



+ 1 = Floor 



i 1 

~Tk 
fc+i , 



+ 1. The numbers 



ms, . . . , m7_|_fc must therefore be in the interval [mx, Mx]. 

Since a 6 ]7^^, 7^[, we have £(m) = l{a) > 7+fe+l. Lemma 2.1.8 (i) applied to the 

second block shows that \rn-j — (mg + • • • + m-j + k) — "^r+fc+i ) | < Ceiling i^/k + 2J — 1. 
Note that we can assume that mi > mi > . . . m 7+ fc +1 and that m 7+ fc +1 > 1. 

Using this information about m±, 7717+^+1, we build a preliminary list of vec- 
tors m as follows: We first take all possibilities for m±, m-j+k+i into account. 
Since the full vector m solves the Diophantine equation for d, the remaining num- 
bers m-j + k+2-, ■ ■ ■ j m M must solve the Diophantine equation (B.l) with 

7+fc+l 7+fc+l 

a = 3d — 1 — m ii b = d 2 + 1 — mf. 

i=i i=i 
Note that we can assume that a > and b > 0. We then take the list Solutions [a,b,M[ [-1] ] 
of all solutions to (B.l) for which all mj are at most m^^+it an d append each such so- 
lution to (mi, . . . , m-j + k+i). It could be that the only solution is (namely if a = b = 0); 
in this case we remove the entry 0. 

Prelist [k_,d_] := Module [{kk=k,dd=d,u, v, ml ,M1 ,mx,Mx,f ,t , 

PP,M,MM,i=0,j=0,s=l,S,T,K,l,L=O}, 
u = l/(kk+l) ; 
v = 1/kk; 

ml = Round [dd/Sqrt [7+v] ] ; 

Ml = Round [dd/Sqrt [7+u] ] ; 

mx = Floor [dd/Sqrt [7+v] u]-l; 

Mx = Ceiling [dd/Sqrt [7+u] v]+l; 

f = Ceiling [Sqrt [kk+2] -1] ; 

t = -f; 
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PP = P [kk] ; 
While [i <= Ml-ml, 
While [j <= Mx-mx, 
While [s <= 3, 
While [t <= f, 

M = Join[ Table [ml+i, {u,7}] , Table [mx+j , {u,kk}] ]; 

M = M + PP[[s]] ; 

S = Sum[ M[[u]], {u,8,8+kk-l}] ; 

M = Append [M, M [ [7] ] -S+t] ; 

T=l; 

If[ M == Sort [M, Greater] && M[[-l]] > 0, T=l, T=0] ; 

S = sum [M] ; 

A = 3dd-l-S; 

B = dd~2+l-M.M; 

If [ Min[A,B] < 0, T=0] ; 

If [ T==l, 

K = Solutions [A, B, M [ [-1] ]] ; 

i=i; 

While [1 <= Length [K] , 

MM = Join[ M,K[[1]] ] ; 

While [ MM[[-1]] == 0, MM=Drop [MM, -1] ]; 

L = Append [L, MM] ; 

1++ 

] 

]; 

t++] ; 

t=-f ; 
s++] ; 
s=l; 

j=0; 
i++] ; 

Return [{dd, Union [L] }] ] 

Many of the solutions (d;m) in Prelist[k,d] are not obstructive. The next code 
removes most of these solutions: 



InterSol[k_,d_] := Module [{kk=k,dd=d,L,M,T,K={} , i=l ,1 ,rest> , 

L = Prelist[kk,dd] [[2]] ; 
While [i <= Length [L] , 

M = L[[i]]; 

1 = Length [M] ; 

T = 1; 

If [ 1 <= 7 + kk + 2, T=0] ; 
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If [ M[[-2]]-M[[-l]] > 1, T=0 ] ; 
If [ M[[-3]] > M[[-2]] + 1 

&& Abs[ M[[-3]]-M[[-2]]-M[[-l]] ] > 1 , T=0 ] ; 
If [ kk==l && 1 >= 10, 

If [ M[[9]] - M[[10]] > 1 && 

Abs[ M[[8]] - (M[[9]] + M[[10]]) ] > 1, 
T=0 ] ] ; 

rest = Sum[ M[[j]], {j,8+kk,l> ]; 
If[ M[[7+kk]] - rest >= Sqrt [l-kk-6] , T=0 ]; 
If[ T==l, K = Append [K, M] ]; 
i++] ; 
Return [{dd,K}] ] 

Recall that a € ]7^j,7|[, a / Zk, where Zk = [7;k,2]. In particular, £(m) = £(a) > 
7 + k + 2. We then test the very end of m: Since the last block has length at least 2, 
we must have tum-i = tum + 1 or uim-i = rriM in view of Lemma 2.1.7. 

We next exploit Lemma 2.1.8: If mjf-2 > tum-i + 1, then we know that the length 
of the last block is 2, and thm-2 belongs to the before the last block. Lemma 2.1.8 (ii) 
then shows that \m,M-2 — (mM-i + ^m)| < 1- In the next test we apply the same 
lemma to the special situation where k = 1 and where we know that the third block 
has length 1. 

In the last test we apply Lemma 2.1.8 (ii) with j = 7 + k. 
We finally take the union over d < D of the solutions in InterSol [k,D] : 

InterSolLess [k_,D_] := Module [{kk=k,DD=D,LL={>,Q,d=l>, 

While [d <= DD, 

Q = InterSol [kk,d] ; 
If [Length [Q[ [2]]] > 0, 
LL = Append [LL,Q]] ; 
d++] ; 
Return [LL] ] 

REFERENCES 

[1] M. Beck and S. Robins, Computing the continuous discretely. Integer-point enumeration in 
polyhedra. Undergraduate Texts in Mathematics. Springer, New York, 2007. 

[2] P. Biran, Symplectic packing in dimension 4, Geom. Funct. Anal. 7 (1997), 420-437. 

[3] P. Biran, Constructing new ample divisors out of old ones, Duke Math. J. 98 (1999), 113-135. 

[4] P. Biran, From Symplectic Packing to Algebraic Geometry and back, European Congress of 
Mathematics, Vol. II (Barcelona, 2000), 507-524, Progr. Math. 202, Birkhauser, Basel, 2001. 

[5] K. Cieliebak, H. Hofer, J. Latschev and F. Schlenk, Quantitative symplectic geometry, Dy- 
namics, ergodic theory, and geometry, 1-44, Math. Set. Res. Inst. Publ. 54, Cambridge Univ. 
Press, Cambridge, 2007. 

[6] I. Ekeland and H. Hofer, Symplectic topology and Hamiltonian dynamics. II Math. Z. 203 
(1990), 553-567. 

[7] L. Guth, Symplectic embeddings of polydisks, Invent. Math. 172 (2008), 477-489. 



84 



DUSA MCDUFF AND FELIX SCHLENK 



[8] G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, OUP, Oxford 
(1938). 

[9] R. Hind and E. Kerman, New obstructions to symplectic embeddings, arxiv:0906.4296. 
[10] M. Hutchings and C. H. Taubes, Gluing pseudoholomorphic curves along branched covered 

cylinders, I, arXiv:math/0701300, to appear in Journ. Symp. Geom. 
[11] M. Hutchings and C H. Taubes, in preparation. 

[12] Bang-He Li and T.-J. Li, Symplectic genus, minimal genus and diffeomorphisms, Asian 

J. Math. 6 (2002), 123-144. 
[13] T.-J. Li and A. K. Liu, Uniqueness of symplectic canonical class, surface cone and symplectic 

cone of 4-manifolds with b + = 1, J. Differential. Geom. 58 (2001), 331-370. 
[14] D. McDuff, From symplectic deformation to isotopy, Topics in symplectic 4-manifolds (Irvine, 

CA, 1996), 85-99, First Int. Press Lect. Ser., I, Int. Press, Cambridge, MA, 1998. 
[15] D. McDuff, Symplectic embeddings of 4-dimensional ellipsoids, J. Topol. 2 (2009), 1-22. 
[16] D. McDuff, Symplectic embeddings and continued fractions: a survey, arXiv:0908.4387, to 

appear in Journ. Jap. Math. Soc. 
[17] D. McDuff and L. Polterovich, Symplectic packings and algebraic geometry, Invent. Math., 

115 (1994), 405-29. 

[18] Dorothee Miiller, Symplectic embeddings of ellipsoids into polydiscs, PhD thesis, Universite 

de Neuchatel, in preparation. 
[19] P. Seidel, Lectures on four-dimensional Dehn twists, Symplectic 4-manifolds and algebraic 

surfaces, 231-267, Lecture Notes in Math. 1938, Springer, Berlin, 2008. 
[20] E. Opshtein, Maximal symplectic packings in F 2 , Compos. Math. 143 (2007), 1558-1575. 
[21] C. H. Taubes, Embedded contact homology and Seiberg-Witten cohomology, I, 

arXiv:0811.3985. 

(D. McDuff) Department of Mathematics, Barnard College, Columbia University, New 
York, NY 10027-6598, USA. 

E-mail address: dmcduff@barnard.edu 

(F. Schlenk) Institut de Mathematiques, Universite de Neuchatel, Rue Emile Ar- 
gand 11, CP 158, 2009 Neuchatel, Switzerland 
E-mail address: schlenkOunine . ch 



